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CONSTRUCTION OF THE GREEN'S FUNCTION 


OF A LINEAR DIFFERENTIAL SYSTEM 


Kenneth S. Miller 


FOREWORD 


The classical Green’s function for a linear differential system 
is generally defined as a function of two variables with certain pre- 
scribed properties. [n this paper we show how the concept of Green’s 
function arises in a natural fashion from the famliar method of solving 
a non-homogeneous differential equation by the method of variation of 
parameters. The link between the two approaches is the one-sided Green’s 
function — which has many useful properties in its own right. 


1. Introduction 

The Green’s function is undoubtedly the most powerful tool that 
exists for investigating properties of ordinary linear differential 
systems. One of the elementary properties of this function is that it 
enables us to solve a non-homogeneous linear differential equation. 
That is, if we have a linear differential equation 


) 


Lu = po (x)ul”? s p,(x)ul*~! Se «ae % p,(x)u 


(where the coefficients are continuous in some closed finite interval 
(a,b) and p, (x) > 0 in [a,b]) and a set of boundary conditions 


n= | _ n 
U.(u) = = Azul* (a) + 


t=1 u 


-1 ; 
z et, we BB act 


(such that the completely homogeneous system Lu = 0, U,(u) = 0, 
a=], 2, ..., m is incompatible) then u(x) where 


b 
u(x) = { Gla, 0)r(2)dz (1) 


is a solution of the non-homogeneous differential equation 
Lu(x) = r(x) 
which satisfies the boundary conditions 
Fe ak a ee ere 


The function G(x,¢) which appears in Eq. (1) is called the Green’s 
function of the system. 
Another more elementary method for solving a linear differential 


1 
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equation is the classical method of variation of parameters. It would 
therefore appear that there should be an intimate relation between 
the two. This is indeed the case and we shall show how the concept of 
Green’s function can be arrived at starting with the method of variation 
of parameters. In the course of our ‘work certain interesting results 
will be obtained, namely, the idea of a “one-sided Green’s function’’ 
which has many practical applications in itself, (e.g., in the theory of 
networks and feed-back circuits). 

In most treatments, the Green’s function is defined as a function 
of two variables which has a discontinuity in its (n - 1)st derivative 
with a finite jump of 1/p,() and formally. satisfies Lu = 0 and U,(u)=0 
except at the point z = (. It is later shown that the function so 
defined has the desired properties, e.g., the result exhibited in 
Eq. (1). However, the development we are about to undertake appears to 
be a more natural argument in that it gives us a clear picture as to 
how the concept of Green’s function arises logically from the familiar 
method of variation of parameters. 


2. The One-sided Green’s Function 
Consider, for simplicity, a linear differential equation of the 
second order, 


Lu = pj(x)u" + p,(x)u' + p,(x)u (2) 
where the coefficients p,, p,, Pj are continuous in some closed finite 


interval [a,b] and p,(x) > 0 in this interval. We shall look for a 


solution u(x) of the non-homogeneous equation 
Lu = r(x) 


(where r(x) is continuous in [a,b6]) which satisfies the homogeneous 
boundary conditions 


u(a) = 0 and u’‘(a) = 0. 


The method of variation of parameters tells us to assume a solution of 


the form 
u(x) = v,(x)P,(x) + v,(x)P, (x) 


where P, (x) and p(x) are any two linear independent solutions of 
Lu = 0, and v,(x) and v, (x) are to be determined. Now, 


u'(x) = vd, + vob, 


and 


a(x) = v,d4°+ v3 + r/p, 
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where we have imposed the two conditions 
iP, + v,h, = 0 
vi P, + voP, = r/p, 
on the functions v, and v,. Forming Lu, 
Lu = py(v,P; + v,h,) + r + py(vyP, + v,%,) 
+ p,(v,P, + v?,) = v,(L9,) + v,(LP,) +rer 


since $, and ~, are solutions of Lu = 0. 
v,(x) and v,(x) can be readily obtained from solving Eqs. (3). 
The determinant of these equations is 


Py(x) = =Do (x) 
| D(x) po (x)| 
which is precisely the Wronskian, W(x), of the linear differential 


operator of Eq. (2) and hence is never zero in [a,b]. Solving these 
simultaneous linear equations yields 











7 -r(x)bp(x) , Mab, (2) 
™ ‘ 7 W(x)p (x) ’ " ? W(x) p, (x) 
and by integrating, 
‘tae r(l) (0) rar 
} Po(S) WE) 
veins | r(C) $,(%) dt +B. 


po(Z) WZ) 


a 


lence the solution, u(x) of Lu = r(x) can be written: 








* r(C) (4) r* r(l) O(0) . 
u(x) = -, (x) saw OE © eke) | ee ee bE 
Po(S) WS) p, (5) WZ) 
Ya a 
+ Ab,(x) + Bd,(x) 
which is the general solution of Lu = r. If now we combine the two 
integrals we have 
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u(x) = (“H(x,0)r(L)dl + Ad, (x) + Bb, (x) (4) 
“a 
where 
-) [9 (4) $,(x)| 
H(x,C) = (5) 


p,(S)W(C) 14, (2) b(L)| 


Obviously, u(x) defined by Eq. (4) has the property that Lu(x) = r(x). 
We must now find the constants A and B such that u(x) satisfies the 
prescribed boundary conditions, namely, u(a) = 0 and u‘(a) = 0. 


u(a) = [-H(x,0)r(l)do + Ad, (a) + Bd, (a) 


a 


0 (6) 


" 


= 0 + AP, (a) + Bp, (a) 


u(x) = H(x,x)r(x) + {HL (x,0)r(C)dl + Abt (x) + BOS (x). 


“a 
But from Eq. (5) we see that H(x,x) = 0 and hence 


u‘'(a) = Ap; (a) + Bp; (a) = 0. (7) 


From Eqs. (6) and (7) we conclude A = 0 = B since the determinant 
of the coefficients is precisely the Wronskian evaluation at x = a. 
We can now state our conclusion in the form of a theorem. 

THEOREM: Hypothesis. Let Lu = p,(x)u"’+ p,(x)u’ + p,(x)u be a linear 
differential equation where the coefficients p,, p,, Pz are continuous 
functions in some closed finite interval [a,b]. Let p, 7 0 in [a,b]. 
Let ,(x) and $,(x) be any two linear independent solutions of Lu = 0 
with Wronskian W(x). Define H(x,0) by Eq. (5). 

Conclusion. If r(x) is any continuous function in [a,b] then the 


function u(x), 


x 
u(x) = i H(x,C)r(C)dl 


satisfies the non-homogeneous equation Lu = r(x) and the boundary con- 
ditions 


u(a) = 0, u‘(a) = 0. 


This function H(x,C) is what we shall call a “one-sided Green’s 
. The definition arises because we are considering a one-point 


function” 








—_~ 


— wy ® FS 


| 
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boundary value problem, that is, our boundary conditions u(a) and u’‘(a) 


Jepend on only one point of the interval. In general, we are interested 
in two-point boundary value problems, that is, where the initial condi- 


tions are given in the form 
U,(u) = Aula) + Anu’ (a) + Asu(b) + Aju’ (bd) ni 
U,(u) = Byu(a) + Buu'(a) + Byu(b) + Byu'(b) 


and depend on both end points of the interval. Of course the one-point 
boundary value problems are included as special cases of these. 

It 1s our purpose in the next section to show how the classical 
Green’s function can be obtained from this one-sided Green’s function. 
Before doing this we shall generalize the above theorem to differential 
equations of the nth order. 

THEOREM: Hypothesis. Let Lu= p,(x)u'”? + p,(x)ul?") tees +p (x)u 
be a linear differential equation with the p,(x) « C° in some closed 
finite interval [a,b]. Let p, (x) > 0 in [a,b]. Let {p, (x), P(x), 

“ D (x)} be n linearly independent solutions of Lu = 0 with Wronskian 
W(x), 








P, (x) D(x) aa P, (x) 
p! (x) D5 (x) oe 
Wx) = 
hed te) sare til 
Define 
D, (x) Py (x) wk p(x) 
$, (0) P,(C) Roni p,(S) 
(-1)"7! 
4 eens (4 ACS rr 4 


Po (S)W(C) 





(n=2) er) Aln-2) (7 4 (n-2) (7 
P; (¢) 9 (¢) eee tn (¢) 


Conclusion. If r(x) is any continuous function in [a,b], then the 


function u(x), 


whe) « [7H x, C)r( 0) de 


satisfies the non-homogeneous equation Lu = r(x) and the boundary con- 


litions u'*)(a) = 0, B= 0, 1, 2, ..., n- 1. 
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3. The Green’s Function 
We have seen in the first theorem that H(x,¢) has the property 
that if 


-X% 
u(x) = | H(x,C)r(C)de (9) 
a 
then Lu = r and u(a) = 0 = u‘(a). Suppose now our boundary conditions 


are given at the point 6 of the interval. That is, we wish to solve the 


equation Lu = 0 subject to the boundary conditions u(b) = 0 = u‘(b), 


Clearly, 
rb 
u(x) = -| H(x,o)r(C)dc (10) 


“2 


has these properties (as can be verified by a direct calculation). Now, 
let us combine these two results. Suppose we wish to solve Lu = r sub- 
ject to the two-point boundary conditions 


u(a) + u(b) = 0 


u'(a) + u'(b) = 0. 


It is clear from Eqs. (9) and (10) that we want the Green’s function to 


behave like +4H(x,0) in the neighborhood of x = a and like -%H(x,) 
in the neighborhood of x = 6. Or to »be more precise, define 


g(x,C) = 4H(x,o) if OG <x 
ete) *~8ite,.¢) 4f 4 > 2, 
We assert that if 


-b 
u(x) = | g(x,C)r(C)dl (11) 


a 


. 


then Lu = r and u(a) + u(b) = 0, u’(a) + u’(b) 0 
To verify this assertion let us calculate Lu = 0 from Fq. (11). 
LThis time we must be careful since g(x,&) and/or its derivatives may 


¥ 


have discontinuities at x = (.] We shall calculate u‘(x) and u(x). 


QD 





b 
u'(x)= limit g(x,f)r(%) - limit g(x,)r(Z) + ( g(x, O)r(C)dc. 
Cx (~2% “a 


(<x i >x 


YD) 


x 
From the definition of g(x, ) we see that both the above limits are 
zero. “ow, 


‘. , y ; y b a2 
u"(x)= llimit g(x, 6) - limit £al%,9)) r(x) +f og(x, Sr (Cdl. 


C-x Ox (~2% IX a Ox 


Cae [>x 











9) 


ns 


he 


()) 
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Since 
= g(x,0) = ——_~ ‘ 
= * 2p, (6) |o,(L) (2) ann 
we have 
a — +] 
MRS? ae 84%) * BE Gy 
C<x 
Similarly, 
Raita 2 oe 
— Ox ats, %) 2p (x) 


{Ps 


From which we conclude that 





r(x) ba? 
“ s a dl 
a p(x) ; J 5z2 slated 


and hence indeed, Lu = r(x). That this u(x) satisfies the boundary 
conditions is also readily veri fied. 


cb b 
u(x) = f glx, L)r(L)dl = fo KHz, Sr (ddl - (4 H(x, C)r(2)al, 


a a x 


iat c 7 { . “{ 


and 


u( 5) 


iT 
—, 
e o~ 
! 
— 
Ege 
iT] 
—\ 
o 


lence u(a) + u(b) = 0. In a similar fashion we see that u’(a) +u'(b)=0. 

The function g(x,¢) we have constructed is a Green’s function in 
the classical sense. Of course, we have constructed it only for a par- 
ticular set of two-point boundary conditions. Let us see how to form the 
Green’s function G(x,¢) for the same differential operator, namely, 
Lu = pou” + p,u' + p,u and the most general two-point boundary conditions 
as given by Fq. (8). From what has been done above, we see that 


G(x,0) = g(x,C) + Hy (C)D, (x) + Yy(0)O, (x) (12) 


has the property that if 


a 


{ 


u(x) = j 


b 
G(x, 7)r(y)dl, 


a 





8 


then Lu 
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r(x) whatever the functions ¥,(C) and ¥,(C) may be. 


We shall now determine ¥, and Y, such that U, (G) and U,(G) are 


If we 1 


zero. 


b 
U,(u) = f U, (G(x, 0) r(0)dt = 0, i 


o this, then 


1, 


z. 


Hence this function u(x) satisfies the boundary conditions of Eq. (8). 


From Eq. (12), 


U(G) = U,(g) + ¥, (CU, (,) + ¥, (CU), 


, = 


1, 


2% 


If we equate these two equations to zero, we can solve for ¥, and ¥,, 


namely: 





|-Uy(g) Uy (2) 
|-U,(e) _Up(t)] 
jU,(¢,) hick 
lu,(®,) u,(¢,)| 


Pp () = 








U,($,) -U, (8) 
Ur($,) -U2(g) 
U,(b,) U,(%) 
U(,) Up (,) 


[We place the restriction on the boundary conditions that 


This is equivalent to saying that the system Lu = 


U,(u) = 0 


= () 


oJ 


U,($,) 


U,(Pp) 


¥ 0. 


0 is incompatible or that the only solution of Lu = 
is u ] 


A compact form of writing Eq. (12) is the following: 





g(x,S) 
y 1 ’ 
— 
(2,6) Du) 4g) 
U,(g) 


P, (x) 
U,(9,) 
U,(P,) 


Py (x) 
U,(?,) 


U,(?,) 








0, U,(u) = 0, U,(u) 
0, U,(u) 


0, 


Of course, G(x,%) can be generalized to nth order linear differential 


systems. 


New York University 

















SYSTEMS OF EQUATIONS, MATRICES 
AND DETERMINANTS 


Olga Taussky and John Todd 


INTRODUCTION 


The solution of systems of simultaneous linear equations is a problem 
which has been before mathematicians for centuries. Jt is known, for 
instance, that the Rabylonians studied systems of up to 10 equations. 
The solution of large systems arose in connection with the adjustment 
of astronomical and geodetical observations and it was in this connection 
that Gauss made notable contributions. More recently, considerable 
attention has been paid to large systems which arise by the approximation 
to differential and integral equations and in studies in economics. 

The study of such systems is linked up with the theory of deter- 
minants. Fourier, in the original investigations concerning the co- 
efficients in a trigononmetrical series representing a periodic analytic 
function solved such systems (involving an arbitrary number of variables) 
without the use of determinants. The first systematic account of the 
theory of determinants is due to Cauchy. An axiomatic presentation is 
due to Weierstrass. 

A most important aid to the study is the theory of matrices, which 
was introduced by Cayley in connection with the theory of transform- 
ations. Since then the theory of matrices has penetrated many branches 
of mathematics. Although matrices do not obey the commutative law of 
multiplication, they are of great service sometimes, in fact, for this 
very reason. They can be used to give representations of more abstract 
mathematical objects such as permutations, groups hypercomplex systems. 
Thus the theory has, therefore, many applications in such branches 
of mathematics as modern algebra, geometry, number theory, the theory 
of automorphic functions. 

An obvious generalization of matrix theory is from the finite to 
the infinite case. This has been largely due to Hilbert and has proved 
of use for instance in the theory of integral equations, in the theory 
of divergent series and in modern theories of physics. [t is important 
to note that the difference between the two theories is considerable, 
the finite case being algebraic, while the infinite is analytic. 

The present article is not intended in any way to be complete; 
it presents only certain topics which, in our opinion, are interesting, 
important or useful. [It has been divided into two chapters which are 
essentially self-contained: the first is theoretical and the second 
is practical. 
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CHAPTER I 


I. 1. DEFINITIONS, PRODUCTS AND SUMS, SPECIAL MATRICES, SIMILARITY 
TRANSFORMATIONS. 

A finite nXm matrix A = (a;,), [aa Gas te Se PEs stce & 
a set of nm real or complex numbers a,,, usually arranged in a 


is cee Gye 
. 

| ° 

l 


rectangular scheme 


any ene) 
If n = m the matrix is called square, otherwise it is called rec- 
tangular., The elements a;, for fixed 1 form the i-th row, for fixed k 
the k-th column of the matrix. If n = m the elements as. form the 
main diagonal or principal diagonal. 

The definition of matrices is only complete after the product and 
sum of two matrices has been defined. The product of two matrices is 
defined only in the case where the number of columns of the first 
matrix is equal to the number of rows of the second. If A = (a,,) is 
an nX m matrix and B = (b;,) is an mX p matrix then the product AB 
is the nXp matrix C = (c,,) where 


° B04 % *** + @,.5 


Cik il “ik im mk* 


Thus the product of two nXn matrices is always defined. For example 


ro 1 0 0) 0 9 |] O 0 0 0 1) 
[-1 0 0 0| 0 00 1] | 0 0 -1 Of. 
| 0 00-1] |-1 0 0 of | 01 0 0 
Loo1 of Lo -1 00 (+10 00 


The product of an nXm matrix A and a column vector i.e. annx] 
matrix, x, 1s defined and is written in the form Ax. 

The product definition originates with the use of matrices in linear 
homogeneous transformations of variables. Let Xi, see, %, be n variables 
and let the variables y,, ..., y, be defined by the transformation 


n 
yi = rg bipx, or y = Bx. 


Let then z,, ..., Zz. be another set of variables obtained from y,, 
1 ’ n 1 


To by the trans formation 











lama 


— =- 


S&S sa @©O es @ 





‘2 | 
—) 


Yn of WD 
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n 
z; = Pd Q:pYk OF 2 = Ay. 


It is easy to see that 
z = ABx 


in the sense of the matrix multiplication just defined. 

A matrix which consists of zeros only is called a zero matrix and 
is frequently denoted by 0; a square matrix which has ones on the main 
jiagonal and zeros elsewhere is called a unit matrix and is usually 
denoted by 1 or I, or by IJ_ if the matrix has n rows. It is easy to 
see that IA = AI, = A for all nxn matrices A. If all the elements 
off the principal diagonal are zero the matrix is called a principal 
diagonal matrix. If all the elements of a matrix below (above) the 
principal diagonal are zeros the matrix is called a right or upper 
(left or lower) triangular matrix. 

Let A be an nXn matrix and let another nXn matrix B exist such 
that AB= J. It then follows that BA= J and that B is unique. The 
matrix B is called the inverse of A and is denoted by A™'. In general, 
however AB # BA If AB = BA the matrices A and B are said to commute 
or to be commutative. Another important point in which matrices differ 
trom ordinary numbers is the fact that the product of two matrices 
can be the zero matrix without either of the factors being a zero 
matrix. It can even happen that the product of two matrices AB is 
a zero matrix, while BA is not a zero matrix, e.g., for 


Ae b . B = p 
0 0 0 0 
A matrix which has no inverse is called singular; it is non-singular 
if it has an inverse. 
If a is a complex number and A = (a.,) then aA is defined to be 
(aa.,). 
The sum of two matrices is defined only in the case where both 
have the same number of rows and columns. In this case, the sum of 
two nX m matrices A = (a.,) and B = (b.,) is defined as C = (c.,) where 


Cin * Sy * OG, 
If 0 is the nX m zero matrix then 0 + A = A+ 0 for all nX m matrices 
A. It is easily seen that both sum and product are associative operations 
and linked together by the distributive law whenever the product 
A(B + C) is defined. The set of all nXn matrices forms a ring, a 
so-called complete matrix algebra. This algebra or hypercomplex system 
has n* base elements which can e.g. be chosen as the matrices with 


all elements but one equal to 0. The non-singular nXn matrices form 
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a group under multiplication. 

The matrix obtained by interchanging the rows and columns of a 
matrix A is called the transpose of A, and is denoted by A’ or A?, 
It is easily seen that (AB)' = B‘A'. If the elements of A are complex 
numbers then A denotes the matrix obtained from A by replacing each 
element by its complex conjugate. The matrix A’ is frequently denoted 
by A*. A square matrix A for which A A*=A*A is called normal. A special 
case of such matrices are the hermitian matrices for which A = A’*, 
A hermitian matrix with real elements is called symmetric. Another 
class of normal matrices is the unitary matrices, defined by A™! = A’, 
A real unitary matrix is called orthogonal. 

An example of a normal matrix which is in general neither hermitian 
nor unitary is given by the incidence matrices, which occur in the 
study of finite projective planes. The elements of such a matrix are 
zero or one, with at least three ones in each row. Further, any two 
distinct rows (columns) have a one in common in exactly one column 
(row). 

If in the equation Ax = b (where x and 6 are column vectors and 
A is a square matrix) the vector 6b is replaced by Sb where S is a 
non-singular nXn matrix, then this new equation is equivalent to 


Slas(S"'s) « b. 


The matrix S''AS is called similar to A or it is said to be obtained 
by a similarity transformation of A by S or that S"'AS is the trans- 
form of A by S. When a matrix A is transformed by another one certain 
quantities are left invariant, whatever S may be. One of them is the 
trace of A which is the sum of the elements on the principal diagonal. 
Another one is the determinant. 


I. 2. DETERMINANTS 
The determinant of A, also denoted by | A| or ja, | is the following 


number associated with the nXn matrix A: 


2 
i £¢ 


See . : ae 
}? ? n a,* 


1 2, nh 

where Lay sees u, runs through all permutations of the numbers l, ..., n. 
The + sign is given to the terms where the permutation is even, the 
- sign when it is odd. The value of |A|] is rarely computed from this 
definition when n > 3. In order to compute it some of the following 
properties of |A| are used: 

(1) The value of |A| is unaltered if a multiple of a row (or colum) 
is added to another row (or column). 

(2) If all the elements of a row (or column) are multiplied by a 
constant c then the determinant is multiplied by c. 


(3) |Al = |A‘| 

















ch 
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(4). If two rows or columns are interchanged the determinant is multi- 
plied by -1l. This implies that the determinant of a matrix with two 
identical rows or columns vanishes; this fact could also be deduced 
from (1). 

Conversely, it can be shown that any polynomial which is linear 
and homogeneous in the elements of each row of a matrix and which 
satisfies (4), and which reduces to ] for unit matrices coincides 
with the determinant. ? 

(5) |A| = 0 if and only if the rows are linearly dependent, i.e. if 
n 

numbers c,, ..., ¢, not all zero exist such that Foor c,a,, = 0, k= 1, 

soeg He 

(6) A matrix is non-singular if and only if |A| # 0. 

(7) Denote by Ai, the determinant of the n - 1Xn- 1] matrix obtained 

from A by omitting the i-th row and k-th column. Then 


2 i +k . 
jal « 2, ODE a Aigs ESL veer 


n 
a at +R 
0-2 (-)'tta a, (ix 5), 
The determinants A;, are also called minors of A. They can be used 
to write down a formal inverse for a matrix A = (a,,): 


ae ia ((-1)' *FA.), 


However since the computation of determinants is rather cumbersome 
this expression is rarely used for numerical applications, and other 
methods for finding the inverse of matrices are employed. This will 
be discussed in Chapter II. 

More general types of minors are obtained if r(l < r <n) rows and 
columns are omitted from A. If the rows and columns omitted have the 
same indices the minor is called a principal minor. 

If A and B are two square matrices then 


[AB] = |A| |B 


This implies the statement made earlier that the determinant of SAS 
is the same as that of A. 

The concept of rank applies to matrices which are not necessarily 
square. Let A be any nX m matrix. Denote by S, the qX q sub-matrix 
obtained by removing a certain n - q rows and a certain m - g colums 
from A. Let r be the largest value of q for which non-singular S 
exist. Then r is called the rank of A. F.g. if a,, = 0 then rank 
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(a.,) = 0 while if ~~ * 1, then rank (a.,) = 1. If m =n and A is 
non-singular it has rank n. It can be shown that the rank of A is 
the maximum number of linearly independent rows (or colunms) in A. 


I. 3. CHARACTERISTIC ROOTS, CANONICAL FORMS, COMMUTATIVE MATRICES, 
QUADRATIC FORMS. 

The fact that the trace and determinant of A are unaltered under 
similarity transformation is only a consequence of the fact that the 
eigenvalues or characteristic roots of A are unaltered under such a 
transformation. These n numbers Aye ‘ay An are defined to be the 
values A for which the matrix (A - AI,,) as singular, or alternatively 


as the roots of the determinantal equation 
JA - Ar| = 0, 


the so-celled characteristic equation of A. Conversely, every algebraic 
equation a,x" os +e et a, * 0 tH, # 0) can be interpreted as 
the characteristic equation of an nXn matrix, e.g. of 
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The characteristic roots can also be defined as the numbers for which 
Ax = Ax 


for suitable vectors x # 0, the so-called eigenvectors or characteristic 
vectors (belonging to A). 


1 
As an example consider the matrix | 9 : It has the characteristic 


roots ] + 2i. The vector x = (2%. x») which corresponds to 1] + 21 
has x, * l, x * 1; the vector which corresponds to 1- 2 has x, = -l, 
X= te 

If S is any non-singular nXn matrix then S'(A - AI)S = S''AS 
-AI; hence |S~'AS - AJ| = |A - AI|. This shows that a transformed 
matrix has the same characteristic equation as the original one. The 
trace of the matrix is the coefficient of (-\)"~' in the characteristic 
equation, the determinant the constant term. The traces of matrices 
are used very intensively in the representation of groups by matrices. 

The characteristic roots need not all be different. If a root is 
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simple then the corresponding vector is uniquely determined apart from 
a common multiplier of its components. If a root is of multiplicity 
r > ] then it can have more than one corresponding characteristic vector: 


e.g. the matrix | has the double root A = 2, while any vector 


(x,, x,) can be taken as corresponding characteristic vector; the 


1 
; a 6S 
matrix i , also has the double root A = 2, but only the vector 


x, = 1, x, = 0 corresponds to it. 
There are other important quantities associated with a matrix which 
are not however, invariant under al] similarity transformations. One 


n 
of them is the normofA=( = la 2)? This quantity is however 
it, ke1 tk 
invariant under all] transformations by unitary matrices. The squared 
norm of Ain the trace of A A*, 

The matrix A A* is hermitian. Such matrices have real characteristic 
roots; in particular real symmetric matrices have this property. Normal 
matrices do not, in general, have real characteristic roots; in fact 
a normal matrix with only real characteristic roots is hermitian. The 
matrix A A* has further all its characteristic roots non-negative. 
A hermitian matrix with only positive (non-negative) characteristic 
roots is called positive definite (positive semidefinite). Analogous 
definitions hold when the characteristic roots are negative (non 
positive). This notation originates from the link between hermitian 
matrices and forms described later. 

It follows from the definition of the characteristic equation that 
the matrices A and A’ have the same characteristic roots, while the 
characteristic roots of A* and A are conjugate complex numbers. Although 
the characteristic roots of A A* are real and non-negative they are 
in general not the squared absolute values of the characteristic roots 
of A. 

In general a product of two n x n matrices A, B does not have as 
characteristic roots the product of the characteristic roots of A and 
those of B for some ordering of the roots. It was, however, proved 
by Frobenius that commutative matrices have this property. Conmutativity, 
however, 1S not a necessary condition. Pairs of matrices A,B which 
have the property that any polynomial p(A,B) has as characteristic 
roots p(A;, /,) for some ordering of the roots A, of A and of the roots 


MA 


i of B are called pairs of matrices with property P. They include 
all pairs of commutative matrices and depend on properties of the 
commutator AB ~— BA, 

The characteristic roots of AB and BA always coincide. If, e.g. A 
is non-singular this follows easily from the fact that BA = A~'(AB)A. 
[t follows that the characteristic roots of a product A, ... A, are 
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unaltered if the factors are permuted cyclically. This shows again 
that S''AS and A have the same characteristic roots. 

The similarity transform of a symmetric (hermitian) matrix is in 
general: not symmetric (hermitian); however, the transform by means 
of an orthogonal (unitary) matrix is. It is further known that every 
hermitian matrix can be transformed by a unitary similarity into 
diagonal form i.e. into the form 


rN, ) 








L Mn 

with zeros off the principal diagonal. This is true whether the A,’s 

are simple or not. The same is true for normal matrices (only the 

Aj are then not all real), and this fact constitutes an alternative 
definition for matrices to be normal. 

A general matrix cannot always be transformed by a similarity into 


diagonal form, e.g. F i: If all the characteristic roots are simple 


then it is always possible. If there is a multiple root of mltiplicity 
r (>1), but with less than r independent corresponding characteristic 
vectors then transformation to diagonal form is not possible by means 
of a similarity. If there are multiple roots then the matrix can in 
general only be transformed into triangular form, and this is always 
possible by a unitary transformation. A particular triangular form 
very frequently used is the Jordan normal form. 
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A pair of matrices which has property P can be transformed to triangular 
form simultaneously by the same similarity transformation. A pair of 
matrices which can be transformed to diagonal form simultaneously 
by the same similarity transformation commutes. 


Real symmetric matrices A = (a,,) are closely connected with quadratic 
n 


forms ites a;,%,x, (or x'Ax). The fact that every quadratic form 


can be transformed into a sum of squares by a linear transformation 
of the variables corresponds to the fact that the matrix A can be 
transformed to main diagonal form by a transformation of the type 
S'AS, If the transforming matrix S is orthogonal so that S’' = S 

then we obtain the reduction to.principal axes for the quadratic form. 


Similar results hold for hermitian matrices and their associated form 
n 


2 Qs pX Xp: Skew symmetric matrices are real matrices (a,,) with 
i,k=1 


a.,=-a@,,. These matrices have as characteristic roots pairs of con- 
jugate purely imaginary numbers or zero. If they are non-singular 
(this is only possible for even n) they can be transformed by ortho- 
gonal matrices to the form 
( 0 a, 
-a2 0 


0) a. 





° | 
t a 


Any real matrix can be expressed as the sum of a symmetric and a 
skew-symmetric matrix. Further, any matrix can be expressed in the 
form H, + wH, where H,, H, are hermitian. 

Symmetric non-singular matrices have symmetric inverses; their 
products, however, are in general not symmetric. In fact every matrix 





can be expressed as the product of two not necessarily real symmetric 
matrices. The product of two symmetric matrices will be symmetric 
matrices will be symmetric only when the matrices commte. 


I. 4. BOUNDS FOR CHARACTERISTIC ROOTS, FIELDS OF VALUES, POSITIVE 
MATRICES 


For many purposes it is not necessary to know the exact values of 
the characteristic roots, and certain estimates will suffice. Some 
of these estimates arise from considering the matrix A A* whose char- 
acteristic roots are non-negative. It can be shown that each JA, |? 
lies between the largest and the smallest characteristic roots of 


; -- %y ; 
2, Further, | = IAs does not exceed the norm of A. Equality 
i=] 
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between these two sums holds if and only if Ais normal. It is further 


known that A, | < max r;, where r; is the sum of the absolute values 
u 


of the elements of the i-th row of A and an analogous theorem holds 
for the columns. 

A great amount of information concerning the position of the char- 
acteristic roots 1s obtained from considering the field of values of 


the matrix. This is the set of complex numbers 2 a,,x,%, where 
i, k= 
n —_ 
Xyy see, is any set of numbers with 2 xx, = 1. The field of 
i=} 


values is a closed convex and connected set which contains the char- 
acteristic roots inside or on the boundary. If A is normal then the 
set coincides with the smallest convex polygon which includes all the 
eigenvalues. If Ais hermitian or real symmetric the field of values 
is a segment on the real axis and since A is then normal its end points 
are characteristic roots. This gives the well known fact that for a 


n _—_ 
symmetric matrix the largest characteristic root 1s max 2 Qs pX i Xp, 
=| 


n 
where 2 x,x, = 1, and the smallest characteristic root is min 
t=] 


tl, 


oie Os pXiXp- 

If Ais a matrix with positive elements only - a so-called positive 
matrix — then its characteristic root of maximum absolute value, the 
so-called dominant root, is real and positive and simple; its character- 
istic vector can be chosen to have all its components positive. Further- 
more, only the dominant root can have such a vector. 

If Ais an arbitrary matrix (a;,) and B a matrix with positive 
elements 6., such that |a, wl < < b, jp then any circle with center at 0 
which includes all sharactesiatic Toute of B also includes the charac- 
teristic roots of A. 

Of particular interest are the matrices which have all their charac- 
teristic roots inside the unit circle. These matrices are important 
in the study of the convergence of iterative processes. 

Let us return to discussing bounds for roots of general matrices. 
An important class of bounds is obtained from the following theorem 
on determinants with dominant principal diagonals. 


If la bad > 2 la Gen | i=1, ..., n, then |A| #4 0. This theorem 
1 
can be okie to the study of bounds for characteristic a of 
matrices. It is applied to the characteristic determinant la. a. * MB al, 


where Ou, = 1 for i = k and 0 for if k. It follows that this detersinant 

can only vanish, i.e. A can only be a characteristic root of (a,,), 

if A lies inside or on the boundary of one of the n circles with centers 

a.. and radius a, | a. 
l 





tt 


If the elements are all real and the principal diagonal elements 
positive then it can even be shown that the determinant of a matrix 
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with dominant principal diagonal is positive, not merely different 
from zero. Moreover, the characteristic roots of such a matrix have 
positive real parts. If none of the non-diagonal elements of such 
a matrix 1s positive then the characteristic root with smallest real 
part is real. The latter results follow from the facts that a matrix 
with positive elements has a real dominant root, and that the given 
matrix can be expressed in the form A = a[ - P where © is a positive 
number and P a positive matrix. 

Matrices with non-negative principal diagonal and non-positive 
off diagonal elements have been studied extensively. 


I. 5. FUNCTIONS OF MATRICES, COMMUTATIVE MATRICES, LIMITS. 


The characteristic roots play an important role when functions of 
matrices are studied. If Ais an nXn matrix then any polynomial in 
A can be expressed as a polynomial in A of degree less than n. The 
matrix A itself satisfies an equation of degree n, namely its own 
characteristic equation when the constant term c is replaced by cI, and 
zero by the zero matrix. This constitutes the so-called Cayley-Hamilton 
theorem. The matrix can, however, satisfy an equation of degree less 
than n, and the equation of smallest possible degree is called the 


minimum equation of the matrix; e.g. the matrix f has the char- 
acteristic equation x? - 4x + 4= 0 and the minimm equation x - 2 = 0; 
the matrix has characteristic and minimum equation both equal 


to x* - dx + 4= 0, 

A rational function in A with non-singular denominator can again 
be expressed as a polynomial. Any polynomial in A commutes with 4; 
a partial converse of this is the fact that any matrix which commutes 
with a matrix A which has only simple characteristic roots is a poly- 
nomial in A. Even if A and B commute it is not always possible to 
find a matrix C such that A and B are both polynomials in C. 

The characteristic roots of a polynomial p(A) in A are the numbers 
p(A.) where A are the characteristic roots of A. If A has all its 


characteristic roots inside the unit circle then lim A” = 0, in the 
rn-@ 
(n) 


sense that if A” = (a,") then lima;, = 0 for all t, j. Matrices 
J nao %J 


with this property play an important role in the study of power series 
in A. [t can, e.g. be shown that the inverse of I - A can be expressed 
as the convergent power series 2 A* if and only if A has all its 
i=1 
roots inside the unit circle. From a remark made earlier it follows 
that matrices of positive elements with row sums (or column sums) 
all less than 1 have all their characteristic roots inside the unit 
circle. These matrices are frequently used in probability theory. 
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We mentioned earlier that the set of all nXn matrices form a 
hypercomplex system with n* base elements. Conversely every hypercomplex 
system with n base elements can be “represented’’ by a subset of the 
set of nXn matrices. If the coefficients in the hypercomplex system 
are real numbers the coefficients in the representing matrix may be 
taken real too. Ry representation is meant a ]-1 correspondence which 
preserves addition and multiplication. Very well known examples of 


a b 
which represent 


such representations are the 2X 2 matrices | 5 
- a 


the complex numbers a+ bi and the 4X 4 matrices 
a b c d 
-b a -d c 
-c a -b 


-d -c b a 


Q 


which represent the quaternions a+ bi + cj + dk. There is a simple 
method of obtaining a representation by matrices for a hypercomplex 
system, the so-called regular representation. Let e,, ..., e. be the 
base elements and a,e, + *** + ae. an arbitrary element of the system. 
The matrices (a,,) defined by the relations e, (a, e, + 999 + aie.) 
> 2a; pep form the so-called regular representation. 





(To be concluded in the Nov.-Dec. issue) 
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INFINITE SERIES AND TAYLOR AND 
FOURIER EXPANSIONS 
(Conc luded ) 
Robert C. James 


Before using series in the solution of such problems, it is essential 
to know something about the nature of the sum function and to know 
under what conditions term by term differentiation and integration 
of an infinite series of variable terms is permissible. One of the most 
useful concepts to use in discussing these questions is uniform con- 
vergence. A series u,(x) + u,(x) + *** converges to S(x,) for x = x 


0 
if for any positive number € the error in approximating S(x,) by 
n 


2 u,(x,) is less than € for all n sufficiently large. If it is 
=| 


possible to find an N such that the error in approximating S(x) by 


n 
ei u,(x) is small for all x in some region or interval I so long as 


n > N, the series is said to converge uniformly. More precisely: 
@ 


A series 2 u, (x) ts uniformly convergent on a set I if it is 
1 


convergent for each x in I and, for any € > 0, there is an N (not 


n 
dependent on x) such that | s(x) - 2 u, (x) | <é€ uf n>Nandx isa 
‘s i) 


point of I, where S(x) is the sum of R u, (x). 
=! 


Uniform convergence essentially requires that the remainder after 
n terms of the series can be made small for all x in the set I simply by 
taking n large enough. If there is a convergent series of non-negative 
constants M for which ju, (x)| < M, for each x in I and each n, then the 
remainder of the variable series can be made small by taking n large 
enough to make the remainder of the series of constants small —- and can 
thus be done independently of x. This test for uniform convergence 
is usually called the Weierstrass M-test, 

If the terms of a series are all continuous at x = x,, the sum of 
the first n terms is also continuous at x.. But if the remainder is 
small for all x near x,, then Ree S(x,) can not be much larger than 
the difference between the values of the sum of the first n terms at 
x and at x,. This intuitive argument can be made rigorous to prove 
that a sertes of continuous functions which converges uniformly has a 


continuous sum, 
n 


Similarly, if the sum S(x) is very nearly equal to 2 u,(x) at 


=] 


b 
all points of an interval (a,6), then i S(x)dx must be very nearly 


b 
equal to f [ 


n a n b 
> u, (x)) dx, which is equal to 2 i u, (x)dx..This type 
=| a 


a =| 


21 
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i) 
ho 


of argument can be used to prove that a uniformly convergent series of 
continuous functions can be integrated term by term. It follows from 
this that a convergent series may be differentiated term by term if 


the derived series converges uniformly and each derivative 1s continuous. 
@ 


3. Power series. A series of type a2 c (x - a)*, where each C, 


1s a constant, 1s called a power series. By use of the ratio test, it 
can be shown that this series converges absolutely for |x - al < rif it 
converges for one value of x satisfying |x - a| = r. If it diverges 
for one value of x satisfying |x - a] = r, it diverges if |x - a| > r. 
Thus there is a constant R, called the radius of convergence, such 
that the series converges if |x - a| < R and diverges if |x - a| > R. 
Use of the ratio test also shows that the series derived by differ- 
entiating each term of the series, or by integrating each term, has 
the same radius of convergence. It follows from the Weierstrass M-test 
(with M, = c, rt) that if r, < R, then the series converges uniformly 
on the interval of all x satisfying | x - a| < roe Thus the terms of 
a power series may be differentiated or integrated within the interval 
of convergence, the result being the derivative or integral of the sum. 
As an example of this, consider the series 1 - x? + x4 - x® + : 
This is a geometric series which has the sum 1/(1 + x2) if |x] < 1. 
By integrating the sum and the series between 0 and 1/ 3, one gets 
7/6 = (1/73)(1 - 1/(3°3) + 1/(5°37) - 1/(7°33) + 1/(9°34) - «*), 

4. Taylor’s series. The principal uses of series of variable 
terms arise from the possibility of representing rather general func- 
tions as the sum of infinite series. Two types of series for which 
this is possible are power series and series of “orthogonal functions”, 
both of which will be discussed briefly. 

If one assumes that a.given function f(x) has a power series re- 
presentation c, + c,(x - a) + c,(x - a)* + *** which converges to 
f(x) for all x near x = a, the coefficients can be easily evaluated. 
Letting x = a gives c, = f(a). Differentiating f(x) and the series and 
letting x = a gives ¢, f'(a). Differentiating f(x) and the series n 
times and letting x = a gives c, = f'™) (a)/n!, where f'")(a) is the 
nth derivative of f(x) evaluated at x = a. With these values for the 
coefficients {c_}, the power series is called a Taylor’s series for 
the function f(x). Unfortunately there are functions which are not 
the sum of their Taylor’s series, even though the series are convergent. 
It is therefore essential to have some method for determining whether 
the sum of the series is the desired function, as well as whether the 


series converges. The fundamental test of this is whether the remainder 
act 

f(x) - .& c,(x - a)k = R(x) approaches zero as n becomes large. 
=o 

If f(x) has an nth derivative in the closed interval (a,x), then it 

can be shown that there is a number & between a and x for which R (x) 


= [(x - a)"/n!] f'")(€). There are other forms for R(x), e.g. R(x) 





ch 
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= [(x - a)(x - €)®7'/(n - 1)!) f(™)(€). In each case, € is a number 
between a and x, but possibly a different number in different forms 
of the remainder. If n = 1, the mean value theorem of differential 
calculus can be obtained by use of either of these forms of the re- 
mainder. In other words, when n = lone obtains f(x)= f(a) + (x- a) f'(€) 
for some number € between a and x. In general, R(x) is the error 
which results from approximating f(x) by the polynomial of degree 
n - 1 which has the same value at x = a and whose first n - 1 deriva- 
tives have the same values at x = a as the corresponding derivatives 
of f(x). The definition of convergence of an infinite series applied 
i2¢) 
to this situation states that the series 2 [(x - a)"/n!] f'") (a) 
n= 


converges and has the sum f(x) only for those x for which lim R(x) = 0. 
n-@® 


For example, the Taylor’s series for e* with a = 0 is 1 + x + x*/2! 
+ x°/3! +o x4/4! + *** with R(x) = [x™/n']e$, The value of € in R(x) 
is unknown, except for lying between 0 and x. If x > 0, |Ri(x)| < 


\¥*/n!|. Since lim x*/n! = 0 for any value of x, it follows that 
n-@® 
lim R(x) = 0 and that the Taylor series for e* converges for all 


m-@ 
x and the sum of the series is e*. 


; Z 
Such an integral as f e* dx can be evaluatel by means of the 
“0 


cl 
Taylor’s series for e* as | (1 + x? + x4/2! + x6/3! + s**)dx = 1 + 1/3 
/0 
+ 1/(5 2!) + 1/(7 3!) + +++. The error in approximating the integral 
ft r A 
by the first four terms of this series is | RK, (x?)dx = | [x8/4! JeSdx, 
“0 4 


2 


where 0 < & < x* for each x. Since x? < 1 in the range of integration, 


1 
the error is less than e (x8/4! dx = e/(9°4!). 
0 


The Taylor’s series for ln x with a = 1 is (x - 1) - (x - 1)*/2 + 
(x - 1)3/3 - (x - 1)4/4 + ***. In this case, R(x) = [(x - 1)"/n!] 
(-1)"*'(n - 1)!/& for some € between 1 and x. If x am © | R(x) < 
(x - 1)"/n, which approaches zero as n becomes large if x < 2. If x < 1, 
it is better to use the second form of the remainder R. given above. 
This gives R,(x) = (-1)"*'[(x - 1/4) [(x - €)/E""'. For 0< x << <1, 
I(x - O/E] = (& - x)/E < (1 - x). Hence lim R(x) = 0if 0< x <1. 
It has thus been show that the Taylor’s series for In x with a= 1] 


converges and has ln x as sumif 0 <x <2. If x < 0 or x > 2, the 


series diverges. 
As mother application of power series, consider finding the solution 
y = y(x) of the differential equation y” + xy = e* for which, y=] 


M8 


and y’ = 1 when x = 0. If one substitutes > x*/n! for e* and 2 ax 
n=o n=0 











24 MATHEMATICS MAGAZINE (Sept.-Oct. 


@ 
for y, the equation becomes 2a, -'] + .3 x"[(n + 2)(n + l)e..5 + 
a,., ~ 1/n!] = 0. This is equivalent to the equations a, = 1/2 and 
a -a,-,/[(n + 2)(n + 1)) + 1/(n + 2)!, n=1, 2, ***. Noting 
‘that a, = 1 and a, = 1 because of the conditions y = 1 and y’ = 1] 
when x = 0, it is found that y= 1+ x + 12-1,4-1,5, 1 6, 
2 24 60 720 


***,. Tt can be shown that for a large class of differential equations 
this method will yield a series with a positive radius of convergence 
which is a Taylor’s series expansion of the solution of the differential 
equation. 

5. Expansions in series of orthogonal functions. Let L bea set 
of functions defined on an interval (a,b) with the properties that the 
sum of two functions of L is a function of L and the product of a 
function of L and a real number is a function of L. Such a set L is 
an example of a linear vector space. If Lis further restricted by 


4 


| -b L . 
the assumption that || f|| = [ | (f)*dt]* exists for each function f 
a 


belonging to L, then it can be shown that the quantity (f,g) = (feat 
exists for each pair f,g of members of L. If one calls || f|| ive length 
of f and (f,g) the inner product of f and g, these concepts are very 
analogous to the same concepts for ordinary vectors. In fact, suppose 
that two functions u, and u, are orthogonal and let f = a,u, + a,u, and 
g = bu, + bou,. Then || f||? = ( (a,u, + au, 2dt = a*||u, ||? + az|| u, ||? 


4 


b i 
aud (f,g) = ( (a,u, + a,u,)(b,u, + b,u,)dt = a,b, |lu, ||? + a, bo||u, |l?, 
“a 


b ; ; l 
since { (uv)dt = 0. If one normalizes uy and u, by making || u, || = 
a 


lu, Il 1, then with this length and inner product the set of all finite 
linear combinations of u, and u, is equivalent to the ordinary two- 
dimensional vector space with u, and u, playing the role of unit 
perpendicular vectors. 

If one has a sequence u,, u,, u,, *** of mutually orthogonal func- 
tions of unit “length” (called an orthonormal sequence), it is there- 
fore natural to think of these as mutually perpendicular unit vectors 
and to investigate the possibility of expressing a function as an 

@ 


infinite series mi 3 au, the coefficients {a} being thought of as 
generalized components of tle “vector” f. If u,, u,, u, were three 
mutually perpendicular unit vectors and w = a,u, + a,u, + a,u,, then, 
for n= 1, 2, or 3, the length of (w - ku, ) is least for k = a,. Like- 
wise, the length of w - (k,u, + k,u,) is least for k, = a, mdk,= a,. 
By analogy, if a function f is representable by an infinite series 


2 a.u,; one might expect the coefficients {a} to be the numbers 
=| 


n 
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minimizing 


b 
This is equivalent to minimizing -2 (fu,)dt + a,* for each k, which 
“hy ok k 


gives a, = i (fu, )dt for each k. This evaluation of the coefficients is 
a @ 


also obtained if it is assumed that f = 2 a,u, and that the con- . 
=] foe) 


vergence is such that ore can integrate the series fu, = 2 (a,u,u_) 
© n k=] | al aad 


b b 
term by term. Since i uu, = Oifk Zn, md ( (u, )*dt = ], integration 
a “a 


b 
gives { (fu,)dt = a). With these values for the coefficients {a}, 
a 


n b n 
it follows that || f - ae a, u, ||? = ( (f)*dt - 2 a, * for each n and 4 
n b =] /a =1 © 
that 2 a, * < { (f)*dt for each n (Bessel’s inequality). ‘Hence = a, * 
=| ~ 4a =| 


oo 2 b 2 S 2 cb 2 
is convergent and 2 a,° < i (f)*dt. If 2 a = j (f)“dt, then 
b n =1 ~ k=} a 
lim (f - = au,)*dt = 0 and > a,u, is said to converge in the 
n--@ j f k=} k k b=} k k 4 


= b 
mean (of order two) to f(x). If ae a, * = ( (f)*dt for each continuous 
=| “a 
! 


b 
function f when a, | ( fu, )dt, the sequence u,, eg, * >? 1s said to 


be complete (this equality is then also satisfied for any Lebesgue 


measurable function whose square is Lebesgue integrable). It also 


follows that if Uy, U,°°* 1s a complete orthonormal sequence and 


b b 
(fu, )dt = 0 for each n, then i (f)?dt = 0. If f is continuous, 
a 


a 


P 
a, = } 
this implies f = 0. In general, it means that f = 0 at all except a 
set of points of measure zero. Conversely, it can be shown that u,, 
b 
un, *** 1s complete if f (fu, )dt = 0 for eachn implies f = 0 at all 
a 
except a set of points of measure zero — whenever f is a Lebesgue 
measurable function whose square is Lebespue integrable. In other words, 
the sequence is complete unless there is a function which is orthogonal 
to each member of the sequence. 
6. Fourier series. The functions l, cos x, sin x, cos 2x, sin 2x, 
cos 3x, *** are an orthogonal sequence on any interval (a, a +277). In 


at2n 
other words, i cos mx sin nx dx = 0 for all integers m and n, and 
rat2n ‘a at2n ’ é. 
} COS mx cos nx dx = i sin mx sin nx dx = 0 if m#n. This 
“a a 
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sequence can also be shown to be complete, so that for any square 
oo 

integrable function f(x) there is an infinite series a,/2 + 2 (a cos nx 
n=} 


+ b sin nx) which converges in the mean to f(x) on the interval (a, 


2 Zz 


. at2n 
a + 277). Since i cos*nx dx = 
a “a 
l~ pam > ' oa. 0 P 
1/V27, (cos x)/V7m, (sin x)/V7, (cos 2x)/V7, *** is an orthonormal 
sequence and the coefficients in the expansion of f(x) can be evaluated 


as discussed above. This gives: 


nx dx = 7, the sequence 


at+2n . 
( sin 


] -at+2n 
. *2.4 f(x)cos nx dx, if n> 0; 
n T Jo _ 
paten | 
b= | f(x)sin nx dx, if n> l. 
/a 


These formulas can also be obtained by multiplying f(x) = a,/2 + 
@ 


> (a cos nx + 6b, sin nx) by cos nx or sin nx and integrating term by 
n=) foe) . 


term over the interval (a,a + 27). A series a,/2+ > (a, cos nx+ b sin nx) 
n=1 


is called a Fourier series if there is a function f(x) for which the 
coefficients {a} and {b } are given by the above formulas. If Lebesgue 
@ 


integration is used, this can be shown to be the case if 2 (a? + i *) 


n=} 
is convergent. In this case, it is also true that: 
@ a+2n 
lera2z+n & (a2 +562) = [ ( f)*dt. 
y) .¢) n=} n n Ja 


For an example of a Fourier series expansion, let f(x) be defined 
on the interval (-7,77) by f(x) = 0 when -7 < x < 0 and by f(x) = x 
when 0 < x < 7. Then: 


TT 


x dx = n/2, 


qa 2i C f(x)cos nx dx=1 ” x( cos nx)dx= (cos n7- 1)/(7n?2), 
neon 7 7 2o 


b =l 
if 


" y f(x)sin nx dx 


1 “eee nx) dx = -(cos n7)/n. 
1 7T 


e) 


XV 


The Fourier series for f(x) is therefore 


1+ [-2 cos x + sin x] - sin 2x 


4 


no|— 


[ 


. 7 
+ a 3x + 2 sin 3x| - sin 4x + °°*,. 


>| 
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This series converges to f(x) for each x satisfying -7 < x <7, and 
converges to 7/2 if x = -7 or x = +7. In the latter case, the series 
becomes 7*/8 = 1 + 1/3* + 1/52 + 1/72 + 
Unfortunately, the Fourier series of a function may converge in 
the mean to the function, even though for particular values of x the 
series may not have a sum equal to the value of the function at x. 
Point-wise convergence must be established by essentially different 
methods than convergence in the mean. For this purpose, the concepts 
of right-hand and left-had derivatives and of bounded variation are 
very useful. The right-hand and left-hand derivatives of f(x) at 
are defined, respectively, as jim L f(x, +h) - flx, + 0)]/h and 
ak, 


x= 
Oo 


x 
1am (f(x, - h) - f(x, - 0) /h, if these exist under the restriction 
=0 

that his positive. The symbols f(x, + 0) and f(x, - 0) are used for 
lim f(x, + h) and lim f(x, - h), if these exist under the restriction 
ho ° h-o o 

that h is positive. A function f(x) is of bounded variation in an 
interval (a,b) if there is a number M such that for all increasing 
finite sequences of numbers x, = a, x,, x x = bit is true that 


7" 
2 lf(apa,) - fla) | <M 


With the above values of the coefficients, the Fourier series for 
f(x) can be written as follows: 


7 @ +27 
4 { f(t)dt + 1s [cos nx i. f(t)cos nt dt 
27 2-7 7 n= a 
patee 
+ sin nx } f(t)sin nt dt]. 


a 


By use of trigonometric identities, the sum of the first n + ] terms of 
this series cam be written as: 


S,(x) = i tO” + 2 cos(k(t - x))}dt 
7 a k=1 





es sin[(n + %)(t - x)) om 


1 
7 4a 2 sin[k(t - x)] 


Let f(x) be defined, or redefined, outside the interval (a,a + 277) so 
+2n 
that flx + 27) = f(x) for all x. Then if {" f(t)dt exists and, if 


at+2n . 
this is an improper integral, i |f(t)|dt exists, then lim S,(x,) 
a n@- 


can be shown to depend only on the nature of f(x) for x near x. 


Furthermore, it can be show that lim S,(%5) (the sum of the Fourier 
n-o 


series) exists and equals 4l f(x, + 0) + f(x, - 0)] if any one of the 
following conditions is also satisfied: 
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i) %*, is an interior point of some interval in which f(x) is of 
bounded variation. 


11) The right-hand and left-hand derivatives of f(x) exist at x,. 
111) { | (A(x, + t) - flx, + 0) + ~~ t)]- fe = 0)]/t\dt exists 
0 


for some positive number h. 

A convergent Fourier series does not necessarily converge uniformly. 
However, if f(x) is continuous and of bounded variation in an interval 
(a,b) and 0 < € < b - a, then the Fourier series for f(x) will converge 
uniformly to the sum f(x) in the interval (a + €, b- €). 

Trigonometric expansions of a function on an arbitrary interval 
can be obtained from a Fourier series by a change in variable. Thus 
for an interval (a,a + p): 


f(x) = % a, + - [a, cos(27kx/p) + b, sin( 27kx/p)], 
=| 


+ ’ + 
(* ’ f(t) cos(2nmt/p)dt and b= ay f(t) sin( Qnrt/p)dt. 
“a Pp “a 

Let f(x) be defined on an interval (0,L) and be extended to the 
interval (-L,0) by f(x) ='Ff(-x), f(x) then being an even function on 
(-L,L). Letting a = -L and p = 2L in the above then gives the Fourier 
cosine sertes: 


where a = 
n 


wT lho 


’ A kx or kt 
f(x) = % a, + roan oe where a, = ; J, f(t)cos - dt. 
Similarly, if f(x) is extended by f(-x) = -f(x) so as to be an odd 
function, one gets the Fourier sine series: 
= k7rx r kit 
f(x) = 2 6b, sin —, where b, = 2 (tisin —— dt. 
k=1 & L » £ Jf L 


7. Some applications to the solution of boundary value problems. 
Consider the problem of finding the vertical displacement y(x,t) of 
an elastic string of length L stretched between the points (0,0) and 
(L,0) if at time t = 0 it is displaced into a position y = f(x) and 
released. Under certain restrictions and for a suitable constant a, 
y(x,t) will satisfy the partial differential equation 0*y/d*t? = 
a*(3*y/dx*). The following boundary conditions must also be satisfied: 
y(0,t) = y(L,t) = 0 for all t > 0, y(x,0) = f(x) for 0 < x < L, and 
dy(x,0)/ot = 0 for 0< x <L. The function sin(n7x/L)cos(mat/L) 
satisfies the partial differential equation. If nis an integer, it 


satisfies all of the boundary conditions except y(x,0) = f(x). The 
n 


same is also true for any finite sum 2 c,sin(k7x/L)cos(krat/L). 


[It might then seem reasonable to try to determine constants {c,} for 








al 


US 


ie 
on 











1952) INFINITE SERIES AND TAYLOR AND FOURIER EXPANSIONS 29 


sin(k7x/L)cos(krat/L) is the desired solution. The 


@ 


@ 
which y= 2 C, 
k >| 


remaining boundary condition then becomes f(x) = Pe: c, sin(k7x/L) 
=| 
for O< x <L. If c,= . {7 f(t) sin( krt/L)dt, this is the Fourier 
0 


sine series for f(x) in the interval (0,L). With suitable restrictions 
on f(x), it can be showr that, with these values for the coefficients 
@ 


{ec}, y= = C, sin(k7rx/L)cos(krat/L) is the desired solution. 
=| 


Let us next consider the problem of finding the steady-state temper- 
ature distribution of a circular plate whose circumference is kept 
at a prescribed temperature and whose faces are insulated. If polar 
coordinates are used, Laplace’s equation (which the temperature U(r,@) 
must satisfy) has the form: 

Oy 1 dy 1 dy 


Or2 : r 062 r Or 


If one substitutes U = f(r)g(@), it is found by separatiny variables 
in the resulting equation that (c,r" or *)(c,cos kO + c,sin k@) 
is a solution of the equation for arbitrary values of c,, cz, C3, C4, 
and k. If one sets c, = 0 so the solution will be continuous inside 
the circle, this can be written in the form c,r*cos kO + d,r*sin kO, 


+ € 


For suitable choice of the coefficients, the series U(r,9) = % c, 4 
2 (qrtcos kO + dyr*sin k) will be the desired solution. If the 
circle is of radius ( and the temperature on the boundary of the circle 
is given by U(p,@) = f(@), we must have f(@) = % c, + 5 (eyptoos kO 
+ d,pksin k@). If this is to be the Fourier series for f(9), then 


ee wae OMe toe (BG 
c. = —- : f(@)cos kO dP and d, = oe j f() sin(k@) dO. 


8. Legendre polynomials. [It seems appropriate to discuss at least 
one of several sequences of orthogonal functions which are of great 
value in applied mathematics. The Legendre polynomials are a sequence 
of polynomials P, = 1, P, = x, P, = Y(3x* - 1), *** such that P(x) 


1 
is of degree n and { Pi(x)P.(x)dx = Oifam #n. This determines the 
-1 


polynomials except for multiplicative constants. More explicitly, 





t #ts*~ 1" 
P(x) = ——__—— = 


(2n - 2k)! 
k -2k 
“ 2"n! dz" k (-1) ’ 


0 2" -k!(n-k)!(n- Qk)! 





na 


where m = n/2 or (n - 1)/2, whichever is an integer. 
The Legendre polynomials form a complete orthogonal sequence on 
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the interval (-1,1). The expansion of a function in a series of Legendre, 
polynomials therefore converges in the mean on the interval (-1,1) 


if 
to the function and has the form f(x)= > A,P,(~), where A (P_(x)] *ds 
n=0 1 


cl ri 
= | f(x)P,(x)dx. It can be shown that } [P(x)] “dx = 1/(n + ¥), 
1 


so this reduces to e 


1 
A = (n+ ¥%) } f(x)P, ( x)dx. 
1 


n _ 
Sufficient conditions for point-wise convergence are more restrictive 


than for convergence in the mean, but are very similar to those given 

: | =, -| mY 

for Fourier series. In fact, if | (1 - t?) 4A t)dt and | |(1 - t7)7A 
, 


/= J=| 


@ 
f(t)|dt exist and -1 < x. <1, then > ALP (x) will converge and have 
=0 


the sum 4[f(x, + 0) + f(x, - 0)] if any one of the conditions 1, it, 
Or ill given for the convergence of a Fourier series is satisfied. 

Any solution of Laplace’s equation 1s called a harmonic function. 
Legendre’s polynomials are cllled zonal harmonics because of their 
use in such problems as the following, where U is constant on boundaries 
of zones of a spherical surface: Let U be the steady state temperature 
of a sphere whose surface is kept at a prescribed temperature, or 
let U be the electric potential in a sphere whose surface is kept at 
a fixed potential and whose interior is free of charges. If U is assumed 
to be symmetrical about the z-axis and spherical coordinates (r,0,9) 
are used with the origin at the center of the sphere and @ the angle 
between the z-axis and the radius vector, then U wll be independent 


of & and Laplace’s equation (which U must satisfy) reduces to: 


; - ; * oe 
] ( 5 ?u) ] _ du) 

= oo | Sr. 2S . teen Y= .* 
rm or | dr) r“sin DP OF If 


\ - rs 





It can be verified that r”P (cos >) is a solution of this equation for 
n=0, 1, 2, ***. If the radius of the sphere is © and U = f(D) on the 


18 


surface of the sphere, then U will be equal to 2 A r"P (cos b) if 
n=0 
@ 


the coefficients {A} are determined so that f(?) = 2 A OP, (cos 9). 
n=o 
1 
Writing f(d) = F(cos >), it is seen that Am = (n + 4) ( F(x)P (x) dx = 
J- 
(n + 4) ( f(D)P_(cos b)sin d dP. 


“0 
9. Other convergence processes. The following two infinite pro- 
cesses will be discussed briefly in an attempt to widen the reader’ s 
concept of the variety of such processes, other examples of which are 
improper integrals, double and multiple series, continued fractions, 
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infinite determinants and matrices, and asymptotic series. 
An infinite product is an indicated product (1+ a, )(1+a,)(1+ a,)°°° 
of an infinite number of factors. It is said to converge if lim (1 + a,) 


n~-@ 
(1 + a,) *** (1 + a) exists and is not zero. If a, # -1 for any n, 
@ 
then a sufficient condition for convergence is that 2 la | converge. 
n=1 


If C<a, for each n, or -1 fa, <0 for each n, this is also a necessary 
@ 

condition for convergence. If > |a | converges, the product is said 
1 


to converge absolutely. A very useful and interesting infinite product 
; 


sin x = x(] - (x/7)2] [1 - (x/27)?] [1 - (x/37)?] «°°. 


There are various ways of assigning a‘“‘sum’’ to certain divergent 
series. If this is possible, the series is said to be summable relative 
to that particular method. A basic requirement of all such methods 
is that when applied to a convergent series they give the usual sum. 
The following is known as Cesaro’s method of summation. Let a, + a, + 


be an infinite series and s_ = a, + a, + *** + a. If lim (s, + s, + 
no 2 n —_ 2 
+ s,)/n = S exists, then > a, is said to be summable (C1) and 
1 


its sum (Cl) is S. One of the most interesting applications of this 
method is to Fourier series. It can be shown that if f(x) is periodic 


2n 
of period 27 and { f(t)dt exists and, if this is a improper integral, 
27 9 
| f(t) ]dt exists, then the Fourier series for f(x) is summable (Cl) 


at all points x, for which f(x, + 0) and f(x, - 0) exist. Also, its 


° 


sum (Cl) is “l f(x, + 0) + f(x, = @j. 


laverford College 
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Graduate training not required for reading 


SPHERES ASSOCIATED WITH A TETRAHEDRON 
Victor Thebault 


1. Let ABCb be a tetrahedron whose edges BC and DA, CA and 
DB, AB and DC have lengths a and a’, 6 and b', c and c’. We extend 
the edges (DA, DB, DC), ..., from each vertex by k times their lengths 
so that the twelve points obtained (Dj), Di, Do), (A;, A’, A), Race 
are such that AD’ = k DA, BD, = k DB, CD’ = k DC, 

It 1s proposed to evaluate the sum of the powers of each of the 
points (D', D,, D’), .++, With respect to the sphere described on 
the opposite edge as a diameter. 

The sum of the powers P(D}), P(D,), P(D{) of points Di, Dj, D! 
with respect to the spheres described on the edges BC, CA, AB as 
diameters is equal to 


- BC? /4 + DjB,° - CA*/4 + D'C,” - AB’/4 


where A 


,» B,, C, are the midpoints of BC, CA, AB. 
If we apply Stewart’s theorem to triangles DA,D*, DB,D;, DC,D? 








to evaluate DiA,”, D,B, - Dic, . we obtain 


P(D{) = (k+1)(b? + c7)/2+ R(k+1)a'?- k(b'? + c'?)/2- a?/2, 
P(Di) = (k+1)(c2 + a?)/2+ k(k+1)b'?- k(c'* + a'?)/2- b?/2, 
MB) = (ho 1a* + O*)/7+ Rs L)e** - i 0** + 8° *) /2- 0°78, 


By suitable permutations we find analogous formulas for the powers 
of the nine other points (A;, A‘ A’), ..., with respect to the spheres 
described on the opposite edges as diameters. 

The sum of the powers of the twelve points in question is equal to 


= P(Di) = (2k? + 2k + 1) = (a? + a!?), 


“ 


2. If we consider the twelve points (Di, D;, DY), wee, Sl tuated 
on the edges (DA, DB, DC), ..., such that 


we obtain in the same manner 
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PCD?) = k(b* + c*)/2 4 k(k - 1)0’? - (h-1)(b'? ¢ c'?)/2- 2*/2,°°° 


and 


= P(D") = (2k? - 2k + 1) = (a? + a’?), 


In both these cases the sums of the powers in question are positive, 
for 


Q2k*7 + 2k +1 = k® + (k +1)7, 2k? - 2h +1 = k? + (k - 1)?. 
3. Special cases: 


1), k=-l, 2 P(D{) = 2 (a? + a’?), 


Le) 

——d 

x 
" 


0, 2=P(D{) = = P(D*) = = (a? + a’®), 


THEOREM. In a tetrahedron the sum of the powers of the vertices 
with respect to the spheres described on the edges of the opposite 
faces as diameters is equal to the sum of the square of the edges. 


3). k= 1, 2P(DI) 5 2 (a? + a'*), = P(DI) = 2 (a? + a’), 


An analogous question for the triangle has been discussed by C. A. 
Laisant, Nouvelle Correspondance Mathematique, Vol. 3, 1877, p. 394. 


Translated from the French by W. E. Byrne. 
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TEACHING OF MATHEMATICS 
Edited by 
Joseph Seidlin and C. N. Shuster 


This department is devoted to the teaching of mathematics. Thus articles on 
methodology, exposition, curriculum, tests and measurements, and any other 
topic related to teaching, are invited. Papers on any subject in which you, as 
a teacher, are interested, or questions which you would like others to discuss, 
should be sent to Joseph Seidlin, Alfred University, Alfred, New York. 


A REFLECTIVE APPROACH IN THE TEACHING OF MATHEMATICS 
Charles A. Johnson 


The purpose of this article is to illustrate a reflective approach 
in the teaching of a topic in mathematics through the study of ideas 
in contrast or conflict. 

A word of explanation of the pedagogical principles involved is in 
order. The technique is essentially one of posing a provocative question 
at the outset, allowing the class time to consider possible solutions, 
and, finally, adopting a conclusion. The choice of the question is 
governed by its potentialities for throwing light on conflicts in 
ideas which reveal inadequacies or disharmonies in the outlook of the 
student. The question may seem to the class to concern something that 
they already thoroughly understand and will often be such as to evince 
a definite “yes” or “no” response. Then the follow-up question comes: 
“How do you know?’”’ It is essential to the method that questions come 
first and conclusions last. Between the question and the conclusion 
there must be ample time for the class to reflect upon alternative 
hypotheses which they have formulated in attempting to answer the 
question. Fundamentally, it is the application of the scientific 
method in the classroom consideration of problems. It is the antithesis 
of the “telling’’ method of teaching. 

I shall illustrate the technique by outlining the procedure which 
might be followed in presenting Maclaurin’s Series. 

It is assumed that what follows has been immediately preceded by 
by a study of power series and their properties as well as the ideas 
of convergence and divergence of series. 

We may begin by considering the following relations: 


W(l - x) = 1+ xt x2 + x3 + x4 + 
(1 - 2)7' =) +24 448+ 


(1 + x)* = 1 + x/2 - 22/223! + 3x3/2331 - 
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The first of these may be obtained by a simple process of division 
while the second and third may be presented as applications of the 
Binomial Theorem. This is the approach suggested in Middlemiss’ ‘“‘Diff- 
erential ad Integral Calculus’’. 

Now the class is confronted with this question: Are these relations 
valid? Of course, the student may have a definitely apathetic attitude 
toward the question because he does not immediately appreciate the 
consequences of his answer. A student in one of my classes recently 
reacted submissively with the comment, “Math. is math.’’ Many, in fact, 
will see no sense at all to the question. But if they are challenged 
to substantiate a “yes’’ or ‘‘no” response the discussion will be likely 
to proceed as follows: 

“They were obtained by legitimate processes, so it seems to me 
that they should be valid. Put the second one is obviously an ab- 
surdity.” 

“Yes, andif you put x equal to 4 in the first one, you get another 
absurd result.” 

“But if you put x equal to % in the first one you obtain a relation 
which we know to be true.” 

The class now appreciates the conflict because it realizes that 
the relations under consideration are both “true” and ‘‘ false” depending 
on the choice of value for x. We proceed to a discussion of this 
question: For what values of x do these relations seem to be valid? 

“The first relation seems to hold for | x| 3," 

What set of values does this represent? 

‘*The interval of convergence of the power series.” 

Now we proceed with a discussion of the third relation and show 
that the values represented by the interval of convergence of the 
power series in it define the values for which it is a valid relation. 
We conclude that a power series can represent certain algebraic func- 
tions provided that we limit it to these values. 

We have seen that an algebraic function can be represented by a 
power series and that leads us to the consideration of this question: 
Can functions such as e*, sin x, log x, etc. be expressed as power 
series? Again the student may not immediately experience a conflict 
but when he sees that a power series is an algebraic expression in 
x and was used to represent another algebraic expression, he may begin 
to feel that perhaps a transcendental function, which involves opera- 
tions ‘‘ transcending” the laws of algebra, cannot be represented by 
a power series. 


To answer this question we recall the definition of e* as lim (1 + 
n=® 


x/n)", Can we use a power series to express this relation? 
“(1 + x/n)" can be expanded by use of the Binomial Theorem to 
give: 1+ x + n(n - 1)x*/2!n? + +++. Or, after taking the limit, 


eX =] + x + x*7/2Qt + oo0.” 
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Now we have seen that functions other than those which are algebraic 
can be expressed as power series. Can we expand a trigonometric func- 
tion as a power series? That is, is there a relation such as: sin x = 
a, + a,x + a,x” + a,x° lial 

“Maybe there is but how would we get the values of the a’s? ” 

Is there a way that we could obtain a,? 

“Let x equal 0.” 

This evaluates the first constant and we then have sin x = a,x + 
‘< + a,x° + ***. Now how would we get a,? 

“If the right-hand side of the relation began with a,, we would 
be able to repeat the process that we used in getting a,. If we divide 
by x it would seem to do it.” 

If we do that we will have (sin x)/x = a, + a,x + aun* + 298, 
Having used this to evaluate a,, we proceed to the next one. 

Now how will we obtain a,? 

“We could divide by x again.” 

This gives: (sin x)/x* = l/x + a, + a,x + a,x* + -*°*, Can we use 
this relation to evaluate a,? 

“No, because of the term 1]/x’.” 

It seems to me that the procedure for arriving at the expansion 
for sin x as outlined above is most likely to evolve although most 
textbooks assume that we would immediately proceed to use differ- 
entiation. In fact, I outlined this procedure before using it in the 
classroom and subsequently found that my classes actually did follow 
this line of reasoning. That is why [ followed out the method of 
livision although, as we have seen, it did lead the class up a blind 
alley. If we proceed this way, however, we are not forcing things to 
go according to a pre-conceived plan; we are letting the ideas of the 
class come out as they will so that, in the end, they will have a 
deeper understanding because they will know what it is as well as 
what it is not. They also get the experience of rejecting a hypothesis 
because it does not work. 

Let us go om with the development of sin x. Now, when we were trying 
to evaluate a, we made the right-hand side start with a, by dividing 
a,x by x. Could we have applied any other process to this term that 
would have made it start wth a,? 

“We could differentiate it.” 

This leads to further questions but the procedure follows smoothly 
from this point on and we proceed to obtain the value of a, and then 
<0 on to derive the complete development for sin x and also the gener- 
alized form for any Maclaurin Series. 

The reader can see how the question as to whether or not we can 
use this method for any function will lead immediately into a discussion 
of Taylor’s Series. 

Assuming that this method is valuable in the teaching of mathe- 
matics, the question might well be asked: Can we apply this method 
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to the teaching of all topics in mathematics? 

It seems to me that every topic worthy of inclusion in the mathe- 
matics curriculum should represent a real problem for the learner - 
that is, it should be one which results from inadequacies and/or 
disharmonies in the outlook of the leamer. A study of ideas in con- 
trast or conflict seems to be very effective in revealing these in- 
adequacies and disharmonies and also provides impetus to the study. 
When this study is carried forward reflectively, I contend that we 
are then promoting the highest type of learning under conditions most 
conducive to success for we are not only making teaching the student 
how to think central in our program but also creating a learning 
situation charged with spontaneous inquisitiveness. 

It is not the contention of the writer that this idea is basically 
novel but merely that conflicts are inherent in every leaming situation 
and possess great potentialities for motivation. Teachers of mathematics 
and writers of mathematics textbooks are urged to exploit these latent 
possibilities as one means of improving the instructional program. 


University of Missouri 
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CURRENT PAPERS AND BOOKS 


Edited by 


MW, V. Craig 


This department will present comments on papers previously published in the 
MATHEMATICS MAGAZINE, lists of new books, and book reviews. 

In order that errors may be corrected, results extended, and interesting 
aspects further illuminated, comments on published papers in all departments 
are invited. 

Communications intended for this department should be sent in duplicate 
to H. V. Craig, Department of Applied \lathematics, University of Texas, Austin 
12, Texas. 


Introduction to Number Theory. By Trygve Nagell, John Wiley & Sons, 
New York 1951, 309 pages, $5.00. 


This textbook is indeed an excellent introduction to the subject. 
After an introductory chapter entitled “Divisibility”, there follow 
chapters on the distribution of primes (essentially Tchebychef’s 
results), congruences, quadratic residues and cyclotomy. Then comes a 
chapter on diophantine equations of degree two, and a chapter on 
diophantine equations of higher degree in two and three unknowns. This 
last topic is Nagell’s specialty, and is very well handled. The book 
concludes with the Erdos-van der Corput version of Selberg’s proof of 
the prime number theorem. 

A stimulating feature is the inclusion of many quite recent results. 
As well as the “elementary” proof of the prime number theorem, Niven’s 
proof of the irrationality of 7 is included, and Mills’ prime giving 
formula is mentioned, though without proof. In addition, the definitive 
results of Siegel and Weil on lattice points on algebraic curves are 
lescribed. There are several excellent collections of problems of all 
legrees of difficulty. 

No account of continued fractions is given, nor of the systematic 
theory of quadratic forms. But Pells equation and Legendre’s equation 
ax* + by? + cz* = 0 are thoroughly treated. Ample justice is done 
throughout the book to the contributions of Scandanavian mathematicians 
to the theory of numbers. 


Morgan Ward 


Lectures in Abstract Algebra. By Nathan Jacobson, D. van Nostrand, 
New York, 1951, 217 pages. 


This book, the first of three projected volumes on the subject by 
Jacohson, is a very good general introduction to modern algebra. After 
the necessary (mildly Bourbakian) preliminaries, there follow four 
chapters on semi-groups and groups; rings, integral domains and fields; 
extensions of rings and fields; elementary factorization theory. Then 
follows an interesting chapter on groups with operators, containing 


39 




















40 MATHEMATICS MAGAZINE (Sept. -Oct. 


among other things Shreier’s refinement theorem, the Krull-Schmidt 
decomposition theorem, chain conditions, direct and sub-direct products. 
The sixth chapter on modules and ideals gives the Hilbert basis theorem 
and the Noether decomposition theorems for ideals of a commlative ring 
with ascending chain condition. 

A reader familiar with the classical treatment of modern algebra 
by van der Waerden, will see that Jacobson has compressed nearly all the 
group, ring and ideal theory in van der Waerden’s two volumes into less 
than two hundred pages. In addition, many more recent results are 
included, and the treatment is throughout easy and lucid. The important 
subject of linear algebra is omitted; it will be treated separately 
in a second volume. 

The book closes with a chapter on lattice theory but treats fully 
only those parts, directly applicable to group and ring theory in line 
with Oré’s development of the subject. 

There are well-seleéted groups of exercises, remarkably few mis- 
prints and no errors that the reviewer could detect. 


Morgan Ward 


Mathematics of Investment. By Paul R. Rider and Carl ‘I. Fischer. 
New York, Rinehart and Company, Inc., 1951. XI +359 pages. $5.00. 


The content of the text and the order of the topics, with few ex- 
ceptions, seen to be that of the traditional text devoted exclusively 
to the subject. Since the text begins with simple interest and simple 
discount followed by compound interest and annuities, a working know- 
ledge on the part of the student of algebra including logarithms and 
progressions is assumed. 

Following the work on annuities, amortization and sinking funds, 
bonds, depreciation and capitalized cost, is extensive work in life 
annuities, life insurance net premiums, and life insurance reserves. 
There are over 1000 exercises in the text. [It may be used for a two- 
semester course or, of course, may be adapted for a one-semester course. 

Considerable care seems to have been exercised in dividing the text 
into normal lesson units. The development of annuities and related 
topics is performed in a very excellent manner from the point of view 
of clarity and consistency. In this connection, this reviewer par- 
ticularly liked the aid of time diagrams. The cases of the general 
annuities have been simplified and unified. 

The authors have evidently devoted a lot of time to the selection 
of problems. This is well illustrated by the chapter on bonds where, 
as the authors say, “*..... the reader will not find the misuse of the 
word ‘dividend’ for the periodic interest payment on a bond, nor will 
the examples give the erroneous impression that the typical bond matures 
for an amount higher than its par value.” Answers are provided only 
for the odd-numbered problems throughout the text. 
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This text has an arrangement of tables which the reviewer considers 
an improvement over the conventional method of interest tables by 
functions subdivided by interest rates. Table II, for example, is 
iene by functions and gives the values of s, v", Sa Ga» and 
sa 

Another feature is the omission of tabular values for a 
supposition that first, this is an aid in giving a simple unifying 
treatment of the general annuity case and second, it is easier to ald 
the interest rate to ‘ to find the function as than it is to perform 
the corresponding subtraction. Too, the lowest rate shown in the table 


is 1% converted monthly. 


on the 


W. H. Bradford 


Fourier Transforms. By Ian N. Sneddon, International Series in 
Pure and Applied Mathematics, McGraw-'lil1 1951, pp xii+ 542. 


In recent years, integral transform methods have superseded the 
older and more intuitive operational procedures for the solution of 
ordinary and partial differential equations in Applied Mathematics. 
Most books on the subject emphasize the role of the Laplace transform, 
implying in most cases transformation of the equation with respect 
to the time variable. By contrast, the book under review is concerned 
mainly with Fourier and Hankel transforms normally involving trans- 


formation with respect to a space variable. 

Following mathematical introductions to the various types of in- 
tegral transforms, including finite Fourier and Hankel transforms, 
there are extensive applications to the Theory of Vibrations, to 
Heat Conduction, to {lydrodynamics and Elasticity, and to certain 
problems in Atomic and Nuclear Physics including cosmic ray showers. 
Other stochastic problems considered are the slowing down of neutrons 
and the phenomenon of turbulent fluid flow. The chapter on Elasticity 
is largely concerned with the theory of Griffith cracks. 

By discussing advanced problems in which he is personally interested 
from a physical point of view, the author has raised his work beyond 
the usual level of books on operational mathematics. On the other hand, 
it should be clear to the reader that the various chapters cannot 
always be regarded as balanced introductions to the subjects with 
which they are concerned (a case in point is the section on turbulence). 
Also, the mathematical introduction at the beginning is not in fact 
intended to be comprehensive and for more delicate questions reference 
should still be made to the book on Fourier integrals by E. C. Titch- 
marsh. However, it must be said that such limitations are almost in- 
evitable in a work of this nature. In the reviewer’s opinion the book 
represents a valuable addition to the literature and, incidentally, 
is a remarkable testimonial to the versatility of its author. 


A. Robinson 
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BRIEF SURVEYS 


As a means of preventing excessive backlogs of reviews, we shall from time 


to time earmark the books on hand for brief surveys. 


Nomography and Empirical Equations. By Lee 1. Johnson, John Wiley 
and Sons, Inc., New York 1952, 150+1x pages. $3.75. 


For some time a revision of Joseph Lipka’s book of 1918, Graphical 
and Mechanical Computation, has been in order. To some extent, the 
volume under review is such a revision, indeed in the preface Dean 
Johnson states that he has patterned this new work after certain 
portions of Lipka’s book. The author is the Dean of the College of 
Engineering of Tulane University — and this new book as its “ prede- 
cessor” is slanted toward engineering rather than toward mathematical 
aspects. ‘he preface states that a treatment of nomography using deter- 
minants has been omitted - a fact which the reviewer regrets. Also, 
the methods for determining constants in empirical equations are dis- 
cussed too briefly to have mathematical appeal. 


Robert E. Greenwood 


Statistical Calculation for Beginners. By E. G. Chambers, Cambridge 
University Press, 1952, x+168 pages, $2.50. 

The first edition of this book was issued in 1940, and contained 
viiitl10 pages. Jt was reprinted several times before this second 
(and enlarged edition) was issued. A review of the first edition may 
be found in the Journal of the Royal Statistical Society, volume 104, 
(1941), page 64. This review closes with the sentence, “‘ihis book 
should be of great help to the beginner”. The new material (on the 
binomial distribution, Kendall’s rank correlation statistic, analysis 
of variance and elementary curve fitting of empirical data) is pre- 
sented as simply as possible, according to the author’s preface. As 
noted in the review of the first edition, the author’s choice of the 
statistics discussed is biased in favor of statistics useful in ex- 


perimental psychology. 


Robert E. Greenwood 
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MISCELLANEOUS NOTES 


Edited by 








Charles K. Robbins 





Articles intended for this department should be sent to Charles K. Robbins, 
Department of Mathematics, Purdue University, Lafayette, Indiana. 








NUMBERS GAME* 





When Shakuntala Devi was five years old in Bangalore, India, she 
liked to sit at the side of her uncle, who was studying mathematics 
at the university. As a joke, he told the little girl about cube roots. 
One day, when Shakuntala wanted money to buy candy, she offered to 
help her uncle with a problem in cube root if he would finance the 
candy. When he laughed, she promptly wrote down the correct answer on 
a sheet of paper. 

From then on, Shakuntala (meaning ‘‘baby brought up by the birds” ) 
was a marked little Brahman. Her odd mathematical ability improved 
rapidly with practice, and soon she was giving demonstrations all over 
India. Later, she moved on to Europe, confounding mathematics experts 
ani makers of computing machines. 

Last week sari-clad Shakuntala, now 20, gave a demonstration in 
Washington before a party of reporters and mathematics professors. 
She had become a master of the arm-long number. Without error or 
hesitation she extracted fourth, fifth and sixth roots of numbers up 
to ten digits. (Her record to date: extracting the 20th root of a 
42-digit number and multiplying figures that yielded a 39-digit result. ) 
Without hesitation, she worked out “magic squares” (horizontal, vertical 
and diagonal sums are identical), starting with random numbers suggested 
by the audience. 

Like most number prodigies, Shakuntala does not know how she does 
it. She thinks about the problem and the numbers come —- in three or 
four seconds. Often she gives the answer as soon as her questioner has 
written down the last digit. In the case of the root problems, thie 
answer must be a whole number. Her mysterious talent does not yield 
uneven answers. She has studied logarithms, but they confused her and 
she does not use them. 

Mathematicians suggest that Shakuntala may have a fantastic memory, 
big enough to store all possible answers to all the tricks she offers. 
But this, they say, would be a startling feat in itself, probably 
as difficult as doing the computation in her head. Shakuntala herself, 
off on a cross-country tour of the UJ). S., just tries to avoid all 
discussion for fear it will disturb her strange talent. Says she: 
‘fT do not know my limits.” 







































*Reprinted from Time Magazine, July 14, 1952. 
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LIQUOR IN EDUCATION 


One hundred and twenty three years ago, when good pure whiskey 
was bouglit by the ton, school-children labored over problems like the 
following: “'aving a desire to collect a number of men for a frolick, 
and wishing to keep them sober; I had a barrel containing 32 gallons 
of whiskey out of which J took 5 gallons and replaced it with 5 gallons 
of water; and finding it too strong, took out 5 gallons more putting 
in 5 gallons of water; I demand the quantities of whisky and water 
in the barrel.” 

The above problem is representative of numerous others found in 
“Connolly’s Arithmetic” of 1829. The prevalence of such problems 
shows — if text books reflect the spirit of the times in which they 
are published - that Temperance Unions were not at all strong 100 or 
so years ago. It also shows that drinking to the “ frolicking” stage 
was not considered a disgrace. The word “ frolick’’ is no longer seen 
in the society columns of the newspapers; perhaps the use of the word 
went out with the serving of whiskey by the barrel, for even in these 
times there would certainly be something resembling a frolic if a 
barrel of whiskey were served at a party. 

Examination of other Arithmetics of Connolly’s time and of earlier 
times reveals that Connolly was not alone in demanding answers to 
liquor problems; In “Daboll’s Schoolmaster’s Assistant”, published 
in 1820, are found about 40 of such problems, a number af which are 
given below: I bought 3 hogsheads of brandy containing 61, 62, and 
63% gallons at $1.38 a gallon, I demand what they amount to. 

A wine merchant bought 721 pipes of wine for $90,846; and sold 543 
pipes thereof for $89,649; how many pipes has he unsold and what do 
they stand him in? 

It might be added here that it takes two hogsheads to make a pipe 
and two pipes or four hogsheads to make a ton; and the small amounts 
which they “stand the merchants in” when selling such large quantities 
may surprise the modern educator, but both Connolly and Daboll insist in 
their respective title page that their text book is “A plain, practical 
system. ” 

A similar claim to practicality is made by Daniel Adams M.D. in 
his ‘Adams New Arithmetic’’ copyrighted in 1827. This book with its 
numerous problems touching on liquors is “calculated to give the pupil 
a full knowledge to all the practical purposes of life”. Like the 
other mathematicians of his time, Adams offers wine measure for measuring 
all spiritous liquors, ale and beer excepted, and ale or beer measure 
for ale, beer, and milk; and does not seem at all worried about Temper- 
ance Unions. 

However another “New Arithmetic” published-in 1846 by Charles Davies 
L.L.D. does show some diminution in the number of such problems; although 
many of those given continue to deal with liquor in large quantities. 
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The following is a fair example of the kind given: Pought a hogshead 
of brandy at $1.25 a gallon and sold it for $78; was there a loss or 
gain? 

Davies text book was one of the last to offer school children such 
problems. This of course was due. to the steadily growing movement 
against the manufacturing and selling of spiritous liquor in the 
country. 


Tsabel C. Sturm 
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Contact Mathematics Magazine, 14068 Van Nuys Blvd. Pacoima, Calif. 











MATHEMATICS MAGAZINE 


Dear Mr. James 


I derive great pleasure and benefit from the reading of the ex- 
pository articles published in the Mathematics \agazine. In connection 
with this matter I wish to make a suggestion which undoubtedly will 
interest many readers of your Magazine. In the plastic stress analysis 
of structures inequalities of the following type are common 
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o» M, °°° M are the unknows. Since textbooks on al- 
gebra do not treat this subject adequately, there is a great need 
for am algorithm for solving these inequalities by a successive elimin- 
ation of the unknowns. The article by L. L. Dines “Systems of Linear 
Inequalities” Annals of Mathematics 1918-19 p. 191 is concerned with 
existence theorems. Jn addition he uses determinants which are not 
good in numerical computations. J hope therefore that you will publish 


In which M,> M 


something in your Magazine in the near future. 
Thanking you very much 
I an 


Yours very truly 
A. Floris 





Correction to AN INVERSION OF THE LAMBERT TRANSFORM, “Mathematics 
Magazine,” 1950, p. 180. 


Dr. W. B. Pennington has pointed out to the author that the proof of 
Theorem 3.3 assumes the continuity of a(t) at t,/k, k= 1, 2, °° 
rather than at t, alone, as stated. A neater theorem would result 
if a(t) were assumed continuous for all positive t. 
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PROBLEMS AND QUESTIONS 
Edited by 


C. W. Trigg, Los Angeles City College 


Readers of this department are invited to submit for solution problems 
believed to be new and subject-matter questions that may arise in study, in 
study, in research, or in extra-academic situations. Proposals should be 
accompanied by solutions, when available, and by such information as will 
assist the editor. Ordinarily, problems in well-known textbooks should not 
be submitted. 

Solutions should be submitted on separate, signed sheets. Figures should 
be drawn in India ink and twice the size desired for reproduction. Readers 
are invited to offer heuristic discussions in addition to formal solutions. 

Send all communications for this department to C. W. Trigg, Los Angeles 
City College, 855 N. Vermont Ave., Los Angeles 29, California. 


PROPOSALS 


140. Proposed by R. B. Herrera, Los Angeles City College. 


ABCDEF is a regular hexagon. The midpoints of sides AB, CD, 
DE, FA are joined respectively to points E, F, B, C forming a small 
rhombus at the center of the hexagon. Express the area of this rhombus 
in terms of a side of the hexagon. [This is problem 15 of the May 16, 
1952 William B. Orange Mathematics Prize Competition. | 


141. Proposed by P. A. Piza, San Juan, Puerto Rico. 


Let abcd, a > 0, be four-digit integers such that bc < 99 is a 
multiple of 9 anda + d = 10. Prove that for n > 3 all integers (10” 
- 1)abed are palindromes. 


142. Proposed by George Pate, Gordon Military College, Georgia. 


A lot contains n articles. If it is known that r of the articles 
are defective, and that the articles are inspected in random order, 
one at a time, what is the probability that the k-th article (k 2 r) 
inspected will be the last defective one in the lot? 


143. Proposed by Leon Bankoff, Los Angeles, California. 


O is the center of the circumcircle of trapezium ADEC,. AD and 
CE produced meet in B. The circumcenters of triangles ABC and DBE 
are 0’ and O”, respectively. Show that 0’'O equals the circumradius of 
DBE and that O”O equals the circumradius of ABC, 


144. Proposed by J. S. Cromelin, Clearing Industrial District, Chicago. 


The wind is blowing at 56 feet per second along the diagonal of a 
Square track, around which a plane and a car are traveling in opposite 
directions. The plane is moving with a unifom air speed, the car with 
a uniform ground speed. The plane first crosses the car at the windward 
comer. The fourth crossing occurs at one o’clock. The seventh crossing 
occurs at the leeward corner, and the eighth at.ten minutes after two, 
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again at the windward corner. How large is the track? 


145. Proposed by Leo Moser, University of Alberta, Canada, 


It is well-know that n = 14 is the smallest even integer for which 
¢(x) = n is insolvable. Show for every positive integer, r, that 


¢(x) = 2(7)" is insolvable. 
146. Proposed by A/3C D. L. Silverman, Lowry AFB, Colorado, 


What is the locus of the center of a circle, radius r, which touches 
each of three mutually perpendicular planes? 


SOLUTIONS 
Problem to Frustrate a Modern Archimedes 


114. [Nov. 1951] Proposed by Dewey Duncan, East Los Angeles Junior 
College. 


(a) In air a gold-surfaced sphere weighs 7588 grams. It is known 
that it may contain one or more of the metals aluminum, copper, silver 
or lead. When weighed successively under standard conditions in water, 
benzene, alcohol, and glycerine its respective weights are 6588, 6488, 
§778, and 6328 grams. ''ow much, if any, of the forenamed metals does it 
contain, if the specific gravities of the designated substances are 
taken to be as follows? 


Aluminum ee Alcohol 0.81 
Copper 8.9 Benzene 0.90 
Gold 19.3 Glycerine 1. 26 
Lead 2.3 Water 1.00 
Silver 10.5 


(b) If the ball is given a very thin coating of plastic, to prevent 
amalgamating, how far will it sink when floating in mercury, speci fic 
gravity 13.6? 


Solution by W. H. Quelch, Florida State University, Tallahassee. 
(a) Let u, v, x, y, z be the volumes in cm? of Al, Gu, Pb, Ag and Au, 
respectively, contained in the sphere, which we assume to be not hollow. 
Since the loss of weight in water (specific gravity 1.00) is 1000 grams, 
the volume of the sphere is 1000 cm’. Then the data, some of which is 
superfluous though consistent, leads to only two independent equations, 
one relating volumes and the other, weights: 


sty *+ e+ y + = © 2000 


2.7u + 8.9v + 11.3x + 10. 5y + 19.3z = 7588 
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Clearly the sphere must contain some aluminum to bring its mean speci fic 
gravity below the specific gravities of all the other metals. There 
is no unique result to this part of the problem, for the amounts of 
three metals may be chosen arbitrarily, provided that the choices 
will not result in negative amounts of any metal. 

If the ball contains only aluminum and gold, there are 294.5 cm® 
of gold and 705.5 cm? of aluminum. Another possibility is 124.7 cm? 
each of Cu, Au, Pb and Ag and 501.2 em? of Al. 

(b) To find the distance, r+ h, which the ball sinks beneath the 
surface, we use the equation: 


h 
500 + 7 ( (r? - x2)dx = 7588/13.4, 
“0 
where r =7750/7 = 6.203505 cm. This leads to the cubic, h® - 115.4504 h 
+ 55.3297 = 0. The only root of this cubic inside +r is h = 0. 48021. 
Therefore the bottom of the ball is 6.684 cm. below the surface of the 
mercury. 

Also solved by W. B. Carver, Cornell University; F. F. Dorsey, 
South Orange, N.J.; E. S. Keeping, University of Alberta; and the 
proposer, who remarked that a large number of integer solutions are 
given by the parametric equations u = 688 - 52a, v = 28 + 83a - 116, 
x= lle, y = 13b - 10c, and z = 284 - 3la - 2b - c. The parameters are 
to be given integer values which yield non-negative values for u, v, 
x, y, z. Another single solution is (u, v, x, y, z) = (400, 300, 200, 
40, 60). 


Peculiarity of Polyhedra 


115. [Nov. 1951] Proposed by Leo Moser, Texas Technological College. 


If all the faces of a polyhedron have central symmetry, prove that 
at least 6 of the faces are parallelograms. (A parallelopiped has exactly 
6 such faces.) 


Solution by the Proposer, University of Alberta. We use the notation 
and results of S. S. Cairns, ‘*Peculiarities of polyhedra,’’ American 
Mathematical Monthly, 58, 684-689, (Dec. 1951), wherein V. = the number 
of vertices each belonging to exactly i edges, and F; = the number 
of faces each having exactly i edges (i = 3, 4, 5, ***). Thus we have: 


V+ F=E+2 (1) 

‘<= «t+ 7+ % © (2) 
rR, + F, + t+ f+ +o (3) 
2E = 3F, + 4F, + SF, + 6F, + °°" (4) 
2E = 3V, + 4V, + SV, + 6V, + °°: (5) 
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Now multiply (1) by 6, and in it replace V and F by their values 
from (2) and (3) and 6E by 2E from (4) and 2(2E) from (5). This yields 
a oe oe ee, te + oe, + 8) + AY + 2, + OFF *** 7}, 

Now a polygon having central symmetry has an even number of edges. 
Hence for the polyhedra in question, F, = 0 for t odd. Thus we have 


F,= 6+ (F, + V,) + 2(F 


4 4 + Vo) + 3(F,, + V) + 


10 
so that F, 2 6 as required, since a quadrilateral having central 
symmetry 1s a parallelogram. 


Also solved by William Moser, Student, University of Toronto, 


A Simple Theorem Regarding the G.C.D. 


116. [Nov. 1951] Proposed by H. H. Berry, University of Kentucky. 


If a, 6, and c are integers such that a+ 6b = c and the least common 
multiple of a and bis M, where ab/(a,b) = M, then (c,M) = (a,)). 
[The symbol (a, b) represents the greatest common divisor of a and b.] 


Solution by J. S. Shipman, Laboratory for Electronics, Inc., Boston, 
Mass. Suppose first that (a,b) = 1. Then M= ab, and (c,M) = (c,ab). 
Now c can have no factor in common with either aor b. Forifc hada 
factor in common with a, we could write c = c‘d, a= a'd, d> 1, and 
then b = c - a= dc’ - a’). thus 6 has a factor d > 1 in common with 
a, contrary to assumption. Similarly, (c,b) = 1. It follows that (c, ab) 
= 1. Therefore, (c,M) = (c,ab) = 1 = (a,b). 

Now suppose that (a,b) = d>1, and write a= a‘d, b= b'd, (a’,b') 
= 1. Then c = a+ b = (a’ + b')d = c'd. Further, M = a'b'd*/d = a'b'd. 
It follows that (c,M) = (c'd, a‘'b'd) = d(c', a‘'b‘) = d. Therefore 
(c,M) = d = (a,b). 

Also solved by A. L. Epstein, Geophysical Research Directorate, 
Cambridge, Mass.; R. F,. Reeves, Columbus, Ohio; and the proposer, 
who remarked that this theorem is useful in solving such problems as 
number 9, page 41, Uspensky and Heaslet, Elementary Number Theory. 
Namely, “The sum of two numbers is 5,432 and their l.c.m. is 223,020. 
Find the numbers.” By the above theorem, M = ab/( 5432, 223020) = 223020 
so ab = 28(223020). When this equation is solved simultaneously with 
a + 6 = 5432, we obtain the numbers 1652 and 3780. 


A Triangular Number Relationship 


119. [January 1952] Proposed by P. A. Piza, San Juan, Puerto Rico. 
Solve t,t t = 468t,° + 468 for x and y. [t, = a(a + 1)/2.] 


gti ~2+2 a 


Partial solution by Leon Bankoff, Los Angeles, Calif. Since R = 
468( t, ° +1) = 4°9°13{[y(y + 1)/2]7 + 1}, inspection reveals that R 
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can terminate only in 4, 6, or 8. NowL = t,t, ,, tian * Mee 1)?(x + 


2)2(x + 3)/8, so L can terminate only in 0 or 8. But L = R, so both 


terminate in 8 and x = 1 (mod 5). Furthermore, L = 0 (mod 13), so 
x = 10, 11, 12, or 0 (mod 13), whereupon x = 36, 11, 51, or 26 (mod 
65). We proceed to test the possible values of x and immediately find 
that for x = 11, L = 11(12)7(13)714/8 = 468[( 4-5/2)? + 1] = R, so 
y = 4. There are no other solutions for x < 352. 

Also partially solved by the proposer, 

Editorial note: A general solution of this problem, or a proof 
that the given solution is unique will be welcome. 


Equilateral Triangles Associated with a Rectangle 


120. [January 1952] Proposed by Victor Thébault, Tennie, Sarthe, 
France. 
On the sides CB and CD of rectangle ABCD construct internally 


(or externally) equilateral triangles CEB and DFC. Show that triangle 
AEF is equilateral. 




















FDA are congment. Whereupon AE = EF = FA and triangle AEF is equi- 








I. Solution by G. W. Courter, Baton Rouge, Louisiana. We general- 
ize the proof to the parallelogram ABCD. In the external case, 
/ FCE = (180° - / ABC) + 60° + 60° = 300° - / ABC, / ABE = 360° 
- (7 ABC + 60°) = 300° - / ABC, and / FDA = 30° - (/ADC + 60°) 
= 300° - / ABC, 7 

In the intemal case, / FCE = 60° + 60° - (180° = 7 ABC) = / ABC 
- 60°, / ABE = / ABC - 60°, and / FDA =/ ADC - 60° = / ABC - 60°. 

Hence, in both cases, / ABE = / FCE = / FDA. Now AB = CD = FC 
= FD ani BE = CE = BC= DA. It follows that triangles ABE, FCE and 
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lateral. 

The proof applies to the figures, where / ABC > 120°. When 60° 
</ ABC <120°, /ABE=/ FCE=/ FDA= / ABC + 60°; when / ABC 
< 60°, / ABE = / FCE = / FDA = 60° ¢ / ABC, where the positive 
sign applies to the external case and the negative sign to the internal 
case. The remainder of the proof proceeds as before. 























II. Solution by C. D. Smith, University of Alabama. The problem 
is a special case of a more general configuration. Consider m andn 
the respective perpendicular bisectors of the sides AB and BC of the 
rectangle ABCD. Choose F on m and describe a circle (F) with radius 
FA and cutting n in E and E', E nearer CB. So (F) is the circumcircle 
of isosceles trapezoid ABEE'‘’,. Then we have a system of isosceles 
triangles: EBC congment to E'AD, AEF congruent to BE'F, AE'F con- 
gruent to BEF, and FCD associated with the system. 

Now let / ABE = x. It follows that / BEC = 2x = / AFE, This 
makes triangles EBC and AEF similar in general. Triangle FCD is 
isosceles but not generally similar to BEC. When x = 30°, triangle 
EBC is equilateral (and hence triangle AEF is also) so CE = CB. 
Consequently, CF is the perpendicular bisector of BE, which makes 
/ FCD = 60° amd triangle FCD equilateral. This proves the interior 


case. 
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When E moves along n until it falls outside the rectangle and 
x= 150°, then triangles EBC and AEF are equilateral. Again CF is 
the perpendicular bisector of BE, so / FCD = 60° and triangle FCD 
is equilateral. This proves the exterior case. 

The general configuration is a problem of three circles on point E 
and determined by the vertices A, B; B, C; and A, D of the rectangle. 
For a discussion of three-circle problems with a triangle of reference, 
see National Mathematics Magazine, 14, 299-307, (March 1940). 

Also solved by Leon Bankoff, Los Angeles, Calif.; F. F. Dorsey, 
South Orange, N.J.; A» L. Epstein, Cambridge Research Center, Mass.; 
Mary Hanania, Beirut College for Women, Lebanon; R. E. Jackson, Student, 
University of California at Los Angeles; Samir Khabbaz, Bethel College, 
Kansas; M. S. Klamkin, Polytechnic Institute of Brooklyn; R. R. Phelps, 
Student, Los Angeles City College; Charles Salkind, Polytechnic In- 
stitute of Brooklyn; Perry Seagle amd N. D, Stanton, Students, Winthrop 
College, S.C.; R. P. Smith, Student, Drake University, Iowa; and the 
proposer. 


QUICKIES 


From time to time this department will publish problems which may be solved 
by laborious methods, but which with the proper insight may be disposed of 
with dispatch. Readers are urged to submit their favorite problems of this 
type, together with the elegant solution and the source, if known. 


“a59. [March 1952] Across one corner of a rectangular room two 
4-foot screens are placed in such a manner as to enclose the maximum 
floor space. Determine their positions.” Charles Salkind offers this 
alternative solution. The two screens and the two walls determine 
an isosceles triangle and a right triangle, the hypotenuse of which is 
the base of the isosceles triangle. The maximum right triangle with a 
given hypotenuse is isosceles. Thus we have two isosceles triangles 
with vertices on opposite sides of their common base. The second dia- 
gonal of the quadrangle thus formed bisects the right angle. Now we 
have two triangles each with a fixed 4-foot base and a vertex angle of 
45°. To be a maximum each of these triangles must be isosceles with 
remaining angles of 67%°, these being the angles made with the walls 
by the screens. 





Q@ 67. Find a solution in positive integers of a® + b* = c°. [Sub- 
mitted by Leo Moser. ] 


Q 68. A courier rode from the rear of a column of marching soldiers 
to the front ad returned forthwith to the rear of the colum. He kept 
his horse jogging along exactly three times as fast as the colum itself 
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was advancing. Where on the road, with reference to the original posi- 
tion of the vanguard, did he complete his journey? [Monte Dernham in 
THE BAT, No. 63, page 472, March 1949.] 


Q 69. The area of the surface bounding the space common to two cir- 
cular cylinders of unit radii whose axes meet at right angles is 16. 


[Submitted by J. H. Butchart.] 


N N 
Q 70. Find the maximum value of > 0,%, i: 48,* &). [Su b- 
n=1 n=1 
mitted by M. S. Klamkin.] 
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